Overview
Brought to you by YData
Dataset statistics
| Number of variables | 107 |
|---|---|
| Number of observations | 724508 |
| Missing cells | 37354957 |
| Missing cells (%) | 48.2% |
| Total size in memory | 576.9 MiB |
| Average record size in memory | 835.0 B |
Variable types
| Numeric | 20 |
|---|---|
| Text | 83 |
| Boolean | 4 |
Dataset
| Description | NMNH Paleobiology Specimen Records (USNM) 0049391-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.ws2uf3 |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "http://biocol.org/urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "PAL" | Constant |
datasetName has constant value "NMNH Paleobiology (USNM)" | Constant |
basisOfRecord has constant value "FOSSIL_SPECIMEN" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
verbatimCoordinateSystem has constant value "Degrees Minutes Seconds" | Constant |
datasetKey has constant value "c8681cc2-9d0a-4c5f-b620-5c753abfe2bc" | Constant |
publishingCountry has constant value "US" | Constant |
typifiedName has constant value "Type" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2024-12-02T10:02:33.848Z" | Constant |
isSequenced has constant value "False" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
hasGeospatialIssues is highly imbalanced (98.1%) | Imbalance |
catalogNumber has 50535 (7.0%) missing values | Missing |
recordNumber has 675939 (93.3%) missing values | Missing |
recordedBy has 563497 (77.8%) missing values | Missing |
preparations has 591600 (81.7%) missing values | Missing |
occurrenceRemarks has 638259 (88.1%) missing values | Missing |
fieldNumber has 720044 (99.4%) missing values | Missing |
eventDate has 474561 (65.5%) missing values | Missing |
startDayOfYear has 593923 (82.0%) missing values | Missing |
endDayOfYear has 593923 (82.0%) missing values | Missing |
year has 474684 (65.5%) missing values | Missing |
month has 572740 (79.1%) missing values | Missing |
day has 596444 (82.3%) missing values | Missing |
verbatimEventDate has 445814 (61.5%) missing values | Missing |
locationID has 335037 (46.2%) missing values | Missing |
higherGeography has 148417 (20.5%) missing values | Missing |
continent has 195168 (26.9%) missing values | Missing |
waterBody has 696851 (96.2%) missing values | Missing |
islandGroup has 723710 (99.9%) missing values | Missing |
island has 714401 (98.6%) missing values | Missing |
countryCode has 158422 (21.9%) missing values | Missing |
stateProvince has 226462 (31.3%) missing values | Missing |
county has 454433 (62.7%) missing values | Missing |
locality has 560871 (77.4%) missing values | Missing |
verbatimElevation has 724311 (> 99.9%) missing values | Missing |
verbatimDepth has 724424 (> 99.9%) missing values | Missing |
decimalLatitude has 620570 (85.7%) missing values | Missing |
decimalLongitude has 620570 (85.7%) missing values | Missing |
verbatimCoordinateSystem has 654265 (90.3%) missing values | Missing |
georeferenceProtocol has 695012 (95.9%) missing values | Missing |
georeferenceRemarks has 724503 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 220036 (30.4%) missing values | Missing |
latestEraOrHighestErathem has 718163 (99.1%) missing values | Missing |
earliestPeriodOrLowestSystem has 245750 (33.9%) missing values | Missing |
latestPeriodOrHighestSystem has 718167 (99.1%) missing values | Missing |
earliestEpochOrLowestSeries has 376914 (52.0%) missing values | Missing |
latestEpochOrHighestSeries has 718290 (99.1%) missing values | Missing |
earliestAgeOrLowestStage has 562472 (77.6%) missing values | Missing |
latestAgeOrHighestStage has 722133 (99.7%) missing values | Missing |
group has 633218 (87.4%) missing values | Missing |
formation has 365706 (50.5%) missing values | Missing |
member has 643191 (88.8%) missing values | Missing |
typeStatus has 582086 (80.3%) missing values | Missing |
identifiedBy has 521981 (72.0%) missing values | Missing |
acceptedNameUsageID has 171789 (23.7%) missing values | Missing |
higherClassification has 172643 (23.8%) missing values | Missing |
phylum has 192842 (26.6%) missing values | Missing |
class has 272566 (37.6%) missing values | Missing |
order has 369296 (51.0%) missing values | Missing |
family has 258765 (35.7%) missing values | Missing |
genus has 245070 (33.8%) missing values | Missing |
genericName has 244897 (33.8%) missing values | Missing |
specificEpithet has 449718 (62.1%) missing values | Missing |
infraspecificEpithet has 718207 (99.1%) missing values | Missing |
taxonomicStatus has 171789 (23.7%) missing values | Missing |
distanceFromCentroidInMeters has 723864 (99.9%) missing values | Missing |
mediaType has 637882 (88.0%) missing values | Missing |
acceptedTaxonKey has 171789 (23.7%) missing values | Missing |
phylumKey has 192842 (26.6%) missing values | Missing |
classKey has 272566 (37.6%) missing values | Missing |
orderKey has 369296 (51.0%) missing values | Missing |
familyKey has 258765 (35.7%) missing values | Missing |
genusKey has 245070 (33.8%) missing values | Missing |
speciesKey has 450165 (62.1%) missing values | Missing |
species has 450165 (62.1%) missing values | Missing |
acceptedScientificName has 171789 (23.7%) missing values | Missing |
verbatimScientificName has 171332 (23.6%) missing values | Missing |
typifiedName has 724501 (> 99.9%) missing values | Missing |
repatriated has 158317 (21.9%) missing values | Missing |
gbifRegion has 160612 (22.2%) missing values | Missing |
level0Gid has 686240 (94.7%) missing values | Missing |
level0Name has 686240 (94.7%) missing values | Missing |
level1Gid has 686243 (94.7%) missing values | Missing |
level1Name has 686243 (94.7%) missing values | Missing |
level2Gid has 687320 (94.9%) missing values | Missing |
level2Name has 687320 (94.9%) missing values | Missing |
level3Gid has 722506 (99.7%) missing values | Missing |
level3Name has 722506 (99.7%) missing values | Missing |
iucnRedListCategory has 365809 (50.5%) missing values | Missing |
individualCount is highly skewed (γ1 = 32.66226483) | Skewed |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
taxonKey has 171789 (23.7%) zeros | Zeros |
kingdomKey has 171929 (23.7%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-08 21:23:36.996405 |
|---|---|
| Analysis finished | 2025-01-08 21:23:59.935414 |
| Duration | 22.94 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Real number (ℝ)
Unique 
| Distinct | 724508 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1489761894 |
| Minimum | 1316557246 |
|---|---|
| Maximum | 4987259380 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1316557246 |
|---|---|
| 5-th percentile | 1316593481 |
| Q1 | 1316738419 |
| median | 1316919584 |
| Q3 | 1317100741 |
| 95-th percentile | 3311023845 |
| Maximum | 4987259380 |
| Range | 3670702134 |
| Interquartile range (IQR) | 362322.5 |
Descriptive statistics
| Standard deviation | 567530383.1 |
|---|---|
| Coefficient of variation (CV) | 0.3809537521 |
| Kurtosis | 11.81732068 |
| Mean | 1489761894 |
| Median Absolute Deviation (MAD) | 181161.5 |
| Skewness | 3.474773969 |
| Sum | 1.07934441 × 1015 |
| Variance | 3.220907357 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1316557253 | 1 | < 0.1% |
| 1316984857 | 1 | < 0.1% |
| 1316984394 | 1 | < 0.1% |
| 3311030571 | 1 | < 0.1% |
| 1316984386 | 1 | < 0.1% |
| 1316984362 | 1 | < 0.1% |
| 1316984370 | 1 | < 0.1% |
| 1316984372 | 1 | < 0.1% |
| 1316984383 | 1 | < 0.1% |
| 1316984409 | 1 | < 0.1% |
| Other values (724498) | 724498 |
| Value | Count | Frequency (%) |
| 1316557246 | 1 | |
| 1316557247 | 1 | |
| 1316557248 | 1 | |
| 1316557249 | 1 | |
| 1316557250 | 1 |
| Value | Count | Frequency (%) |
| 4987259380 | 1 | |
| 4987259379 | 1 | |
| 4987259378 | 1 | |
| 4987259377 | 1 | |
| 4987259376 | 1 |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1449016 | |
| 0 | 1449016 | |
| _ | 1449016 | |
| 1 | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2173524 | |
| Uppercase Letter | 1449016 | |
| Connector Punctuation | 1449016 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1449016 | |
| 1 | 724508 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1449016 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1449016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3622540 | |
| Latin | 1449016 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1449016 | |
| _ | 1449016 | |
| 1 | 724508 |
Latin
| Value | Count | Frequency (%) |
| C | 1449016 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5071556 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1449016 | |
| 0 | 1449016 | |
| _ | 1449016 | |
| 1 | 724508 |
modified
Text
| Distinct | 6008 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 1783 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2014-11-25T18:32:00Z |
|---|---|
| 2nd row | 2024-10-17T09:58:00Z |
| 3rd row | 2024-10-17T10:44:00Z |
| 4th row | 2024-08-03T21:41:00Z |
| 5th row | 2024-10-17T10:17:00Z |
| Value | Count | Frequency (%) |
| 2024-08-03t22:06:00z | 11077 | 1.5% |
| 2024-08-03t22:09:00z | 9194 | 1.3% |
| 2024-08-03t22:08:00z | 6946 | 1.0% |
| 2024-11-18t11:29:00z | 6500 | 0.9% |
| 2024-11-18t11:28:00z | 6488 | 0.9% |
| 2024-10-17t10:55:00z | 6364 | 0.9% |
| 2024-10-17t10:57:00z | 6355 | 0.9% |
| 2024-10-17t10:29:00z | 6348 | 0.9% |
| 2024-10-17t10:28:00z | 6344 | 0.9% |
| 2024-10-17t10:56:00z | 6343 | 0.9% |
| Other values (5998) | 652549 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 856419 | 5.9% |
| T | 724508 | 5.0% |
| Z | 724508 | 5.0% |
| 7 | 523431 | 3.6% |
| 3 | 323301 | 2.2% |
| Other values (4) | 802547 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10143112 | |
| Dash Punctuation | 1449016 | 10.0% |
| Other Punctuation | 1449016 | 10.0% |
| Uppercase Letter | 1449016 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| 4 | 856419 | 8.4% |
| 7 | 523431 | 5.2% |
| 3 | 323301 | 3.2% |
| 8 | 267407 | 2.6% |
| 5 | 251997 | 2.5% |
| 9 | 156334 | 1.5% |
| 6 | 126809 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1449016 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1449016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13041144 | |
| Latin | 1449016 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 856419 | 6.6% |
| 7 | 523431 | 4.0% |
| 3 | 323301 | 2.5% |
| 8 | 267407 | 2.1% |
| 5 | 251997 | 1.9% |
| Other values (2) | 283143 | 2.2% |
Latin
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14490160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 856419 | 5.9% |
| T | 724508 | 5.0% |
| Z | 724508 | 5.0% |
| 7 | 523431 | 3.6% |
| 3 | 323301 | 2.2% |
| Other values (4) | 802547 | 5.5% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 724508 | |
| museum | 724508 | |
| of | 724508 | |
| natural | 724508 | |
| history | 724508 | |
| smithsonian | 724508 | |
| institution | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 5071556 | |
| i | 4347048 | |
| 4347048 | ||
| a | 3622540 | 8.5% |
| o | 3622540 | 8.5% |
| n | 3622540 | 8.5% |
| s | 2898032 | 6.8% |
| u | 2898032 | 6.8% |
| r | 1449016 | 3.4% |
| m | 1449016 | 3.4% |
| Other values (11) | 9418604 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33327368 | |
| Space Separator | 4347048 | 10.2% |
| Uppercase Letter | 4347048 | 10.2% |
| Other Punctuation | 724508 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5071556 | |
| i | 4347048 | |
| a | 3622540 | |
| o | 3622540 | |
| n | 3622540 | |
| s | 2898032 | |
| u | 2898032 | |
| r | 1449016 | 4.3% |
| m | 1449016 | 4.3% |
| l | 1449016 | 4.3% |
| Other values (4) | 2898032 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1449016 | |
| M | 724508 | |
| H | 724508 | |
| S | 724508 | |
| I | 724508 |
Space Separator
| Value | Count | Frequency (%) |
| 4347048 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37674416 | |
| Common | 5071556 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 5071556 | |
| i | 4347048 | |
| a | 3622540 | |
| o | 3622540 | |
| n | 3622540 | |
| s | 2898032 | 7.7% |
| u | 2898032 | 7.7% |
| r | 1449016 | 3.8% |
| m | 1449016 | 3.8% |
| N | 1449016 | 3.8% |
| Other values (9) | 7245080 |
Common
| Value | Count | Frequency (%) |
| 4347048 | ||
| , | 724508 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42745972 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 5071556 | |
| i | 4347048 | |
| 4347048 | ||
| a | 3622540 | 8.5% |
| o | 3622540 | 8.5% |
| n | 3622540 | 8.5% |
| s | 2898032 | 6.8% |
| u | 2898032 | 6.8% |
| r | 1449016 | 3.4% |
| m | 1449016 | 3.4% |
| Other values (11) | 9418604 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 47 |
| Mean length | 47 |
| Min length | 47 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| 3rd row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| 4th row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| 5th row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| http://biocol.org/urn:lsid:biocol.org:col:34871 | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 5071556 | |
| : | 3622540 | 10.6% |
| l | 2898032 | 8.5% |
| r | 2173524 | 6.4% |
| / | 2173524 | 6.4% |
| i | 2173524 | 6.4% |
| c | 2173524 | 6.4% |
| b | 1449016 | 4.3% |
| . | 1449016 | 4.3% |
| t | 1449016 | 4.3% |
| Other values (12) | 9418604 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23184256 | |
| Other Punctuation | 7245080 | 21.3% |
| Decimal Number | 3622540 | 10.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5071556 | |
| l | 2898032 | |
| r | 2173524 | |
| i | 2173524 | |
| c | 2173524 | |
| b | 1449016 | 6.2% |
| t | 1449016 | 6.2% |
| g | 1449016 | 6.2% |
| d | 724508 | 3.1% |
| h | 724508 | 3.1% |
| Other values (4) | 2898032 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 724508 | |
| 8 | 724508 | |
| 4 | 724508 | |
| 3 | 724508 | |
| 1 | 724508 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3622540 | |
| / | 2173524 | |
| . | 1449016 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23184256 | |
| Common | 10867620 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5071556 | |
| l | 2898032 | |
| r | 2173524 | |
| i | 2173524 | |
| c | 2173524 | |
| b | 1449016 | 6.2% |
| t | 1449016 | 6.2% |
| g | 1449016 | 6.2% |
| d | 724508 | 3.1% |
| h | 724508 | 3.1% |
| Other values (4) | 2898032 |
Common
| Value | Count | Frequency (%) |
| : | 3622540 | |
| / | 2173524 | |
| . | 1449016 | 13.3% |
| 7 | 724508 | 6.7% |
| 8 | 724508 | 6.7% |
| 4 | 724508 | 6.7% |
| 3 | 724508 | 6.7% |
| 1 | 724508 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34051876 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 5071556 | |
| : | 3622540 | 10.6% |
| l | 2898032 | 8.5% |
| r | 2173524 | 6.4% |
| / | 2173524 | 6.4% |
| i | 2173524 | 6.4% |
| c | 2173524 | 6.4% |
| b | 1449016 | 4.3% |
| . | 1449016 | 4.3% |
| t | 1449016 | 4.3% |
| Other values (12) | 9418604 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 44 |
| Mean length | 44 |
| Min length | 44 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
|---|---|
| 2nd row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| 3rd row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| 4th row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| 5th row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| Value | Count | Frequency (%) |
| urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 3622540 | 11.4% |
| - | 2898032 | 9.1% |
| 5 | 2898032 | 9.1% |
| u | 2173524 | 6.8% |
| f | 2173524 | 6.8% |
| a | 2173524 | 6.8% |
| e | 2173524 | 6.8% |
| 4 | 1449016 | 4.5% |
| b | 1449016 | 4.5% |
| 8 | 1449016 | 4.5% |
| Other values (10) | 9418604 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17388192 | |
| Decimal Number | 10143112 | |
| Dash Punctuation | 2898032 | 9.1% |
| Other Punctuation | 1449016 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 3622540 | |
| u | 2173524 | |
| f | 2173524 | |
| a | 2173524 | |
| e | 2173524 | |
| b | 1449016 | 8.3% |
| d | 1449016 | 8.3% |
| r | 724508 | 4.2% |
| i | 724508 | 4.2% |
| n | 724508 | 4.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2898032 | |
| 4 | 1449016 | |
| 8 | 1449016 | |
| 9 | 1449016 | |
| 2 | 724508 | 7.1% |
| 0 | 724508 | 7.1% |
| 3 | 724508 | 7.1% |
| 6 | 724508 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2898032 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1449016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17388192 | |
| Common | 14490160 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 3622540 | |
| u | 2173524 | |
| f | 2173524 | |
| a | 2173524 | |
| e | 2173524 | |
| b | 1449016 | 8.3% |
| d | 1449016 | 8.3% |
| r | 724508 | 4.2% |
| i | 724508 | 4.2% |
| n | 724508 | 4.2% |
Common
| Value | Count | Frequency (%) |
| - | 2898032 | |
| 5 | 2898032 | |
| 4 | 1449016 | |
| 8 | 1449016 | |
| 9 | 1449016 | |
| : | 1449016 | |
| 2 | 724508 | 5.0% |
| 0 | 724508 | 5.0% |
| 3 | 724508 | 5.0% |
| 6 | 724508 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31878352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 3622540 | 11.4% |
| - | 2898032 | 9.1% |
| 5 | 2898032 | 9.1% |
| u | 2173524 | 6.8% |
| f | 2173524 | 6.8% |
| a | 2173524 | 6.8% |
| e | 2173524 | 6.8% |
| 4 | 1449016 | 4.5% |
| b | 1449016 | 4.5% |
| 8 | 1449016 | 4.5% |
| Other values (10) | 9418604 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2898032 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2898032 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2898032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAL |
|---|---|
| 2nd row | PAL |
| 3rd row | PAL |
| 4th row | PAL |
| 5th row | PAL |
| Value | Count | Frequency (%) |
| pal | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2173524 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2173524 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2173524 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Paleobiology (USNM) |
|---|---|
| 2nd row | NMNH Paleobiology (USNM) |
| 3rd row | NMNH Paleobiology (USNM) |
| 4th row | NMNH Paleobiology (USNM) |
| 5th row | NMNH Paleobiology (USNM) |
| Value | Count | Frequency (%) |
| nmnh | 724508 | |
| paleobiology | 724508 | |
| usnm | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2173524 | |
| o | 2173524 | |
| 1449016 | 8.3% | |
| l | 1449016 | 8.3% |
| M | 1449016 | 8.3% |
| H | 724508 | 4.2% |
| P | 724508 | 4.2% |
| a | 724508 | 4.2% |
| e | 724508 | 4.2% |
| b | 724508 | 4.2% |
| Other values (7) | 5071556 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7969588 | |
| Uppercase Letter | 6520572 | |
| Space Separator | 1449016 | 8.3% |
| Open Punctuation | 724508 | 4.2% |
| Close Punctuation | 724508 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2173524 | |
| l | 1449016 | |
| a | 724508 | 9.1% |
| e | 724508 | 9.1% |
| b | 724508 | 9.1% |
| i | 724508 | 9.1% |
| g | 724508 | 9.1% |
| y | 724508 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2173524 | |
| M | 1449016 | |
| H | 724508 | 11.1% |
| P | 724508 | 11.1% |
| U | 724508 | 11.1% |
| S | 724508 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1449016 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 724508 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14490160 | |
| Common | 2898032 | 16.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2173524 | |
| o | 2173524 | |
| l | 1449016 | |
| M | 1449016 | |
| H | 724508 | 5.0% |
| P | 724508 | 5.0% |
| a | 724508 | 5.0% |
| e | 724508 | 5.0% |
| b | 724508 | 5.0% |
| i | 724508 | 5.0% |
| Other values (4) | 2898032 |
Common
| Value | Count | Frequency (%) |
| 1449016 | ||
| ( | 724508 | |
| ) | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17388192 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2173524 | |
| o | 2173524 | |
| 1449016 | 8.3% | |
| l | 1449016 | 8.3% |
| M | 1449016 | 8.3% |
| H | 724508 | 4.2% |
| P | 724508 | 4.2% |
| a | 724508 | 4.2% |
| e | 724508 | 4.2% |
| b | 724508 | 4.2% |
| Other values (7) | 5071556 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FOSSIL_SPECIMEN |
|---|---|
| 2nd row | FOSSIL_SPECIMEN |
| 3rd row | FOSSIL_SPECIMEN |
| 4th row | FOSSIL_SPECIMEN |
| 5th row | FOSSIL_SPECIMEN |
| Value | Count | Frequency (%) |
| fossil_specimen | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2173524 | |
| I | 1449016 | |
| E | 1449016 | |
| F | 724508 | 6.7% |
| O | 724508 | 6.7% |
| L | 724508 | 6.7% |
| _ | 724508 | 6.7% |
| P | 724508 | 6.7% |
| C | 724508 | 6.7% |
| M | 724508 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10143112 | |
| Connector Punctuation | 724508 | 6.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2173524 | |
| I | 1449016 | |
| E | 1449016 | |
| F | 724508 | 7.1% |
| O | 724508 | 7.1% |
| L | 724508 | 7.1% |
| P | 724508 | 7.1% |
| C | 724508 | 7.1% |
| M | 724508 | 7.1% |
| N | 724508 | 7.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10143112 | |
| Common | 724508 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2173524 | |
| I | 1449016 | |
| E | 1449016 | |
| F | 724508 | 7.1% |
| O | 724508 | 7.1% |
| L | 724508 | 7.1% |
| P | 724508 | 7.1% |
| C | 724508 | 7.1% |
| M | 724508 | 7.1% |
| N | 724508 | 7.1% |
Common
| Value | Count | Frequency (%) |
| _ | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10867620 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 2173524 | |
| I | 1449016 | |
| E | 1449016 | |
| F | 724508 | 6.7% |
| O | 724508 | 6.7% |
| L | 724508 | 6.7% |
| _ | 724508 | 6.7% |
| P | 724508 | 6.7% |
| C | 724508 | 6.7% |
| M | 724508 | 6.7% |
occurrenceID
Text
Unique 
| Distinct | 724508 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 724508 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/300009e1e-4f3e-4240-b198-9ea1352b28b5 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/30000a59d-34e5-42b6-837d-ad1b89b6b930 |
| 3rd row | http://n2t.net/ark:/65665/3000109b9-b6d6-4ca0-8f0c-ddde53458300 |
| 4th row | http://n2t.net/ark:/65665/30001bcd8-61d5-492a-ad56-f8131f24bdaa |
| 5th row | http://n2t.net/ark:/65665/300020a6b-970f-4e44-adb4-6d605be80b0d |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/300009e1e-4f3e-4240-b198-9ea1352b28b5 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3004266bd-f222-4227-9817-5905ac4cbc57 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30011b937-0eb9-4c75-bea7-c27393598b76 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3002cb891-3b1b-49d8-84ee-8558aba9bf13 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3000a6387-0469-4278-8ac0-fb0ac6fd37d6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3000109b9-b6d6-4ca0-8f0c-ddde53458300 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30001bcd8-61d5-492a-ad56-f8131f24bdaa | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300020a6b-970f-4e44-adb4-6d605be80b0d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300045523-2307-4a34-b888-fb51510870ad | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300045db2-681e-481a-836e-3643bf3debbf | 1 | < 0.1% |
| Other values (724498) | 724498 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 3622540 | 7.9% |
| 6 | 3531516 | 7.7% |
| - | 2898032 | 6.3% |
| t | 2898032 | 6.3% |
| 5 | 2808306 | 6.2% |
| a | 2263386 | 5.0% |
| e | 2084462 | 4.6% |
| 2 | 2083197 | 4.6% |
| 3 | 2083153 | 4.6% |
| 4 | 2081137 | 4.6% |
| Other values (16) | 19290243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19743301 | |
| Lowercase Letter | 17206607 | |
| Other Punctuation | 5796064 | 12.7% |
| Dash Punctuation | 2898032 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2898032 | |
| a | 2263386 | |
| e | 2084462 | |
| b | 1539404 | |
| n | 1449016 | |
| c | 1358538 | |
| d | 1358025 | |
| f | 1357712 | |
| k | 724508 | 4.2% |
| r | 724508 | 4.2% |
| Other values (2) | 1449016 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3531516 | |
| 5 | 2808306 | |
| 2 | 2083197 | |
| 3 | 2083153 | |
| 4 | 2081137 | |
| 8 | 1539173 | |
| 9 | 1539102 | |
| 0 | 1359375 | 6.9% |
| 7 | 1359374 | 6.9% |
| 1 | 1358968 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3622540 | |
| : | 1449016 | 25.0% |
| . | 724508 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2898032 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28437397 | |
| Latin | 17206607 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 3622540 | |
| 6 | 3531516 | |
| - | 2898032 | |
| 5 | 2808306 | |
| 2 | 2083197 | |
| 3 | 2083153 | |
| 4 | 2081137 | |
| 8 | 1539173 | 5.4% |
| 9 | 1539102 | 5.4% |
| : | 1449016 | 5.1% |
| Other values (4) | 4802225 |
Latin
| Value | Count | Frequency (%) |
| t | 2898032 | |
| a | 2263386 | |
| e | 2084462 | |
| b | 1539404 | |
| n | 1449016 | |
| c | 1358538 | |
| d | 1358025 | |
| f | 1357712 | |
| k | 724508 | 4.2% |
| r | 724508 | 4.2% |
| Other values (2) | 1449016 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45644004 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 3622540 | 7.9% |
| 6 | 3531516 | 7.7% |
| - | 2898032 | 6.3% |
| t | 2898032 | 6.3% |
| 5 | 2808306 | 6.2% |
| a | 2263386 | 5.0% |
| e | 2084462 | 4.6% |
| 2 | 2083197 | 4.6% |
| 3 | 2083153 | 4.6% |
| 4 | 2081137 | 4.6% |
| Other values (16) | 19290243 |
catalogNumber
Text
Missing 
| Distinct | 655081 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 50535 |
| Missing (%) | 7.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 14 |
| Mean length | 13.86868317 |
| Min length | 7 |
Unique
| Unique | 638257 ? |
|---|---|
| Unique (%) | 94.7% |
Sample
| 1st row | USNM SD38013 0000 |
|---|---|
| 2nd row | USNM PAL706968 |
| 3rd row | USNM PAL248638 |
| 4th row | USNM PAL456768 |
| 5th row | USNM PAL297724 |
| Value | Count | Frequency (%) |
| usnm | 673973 | |
| 0000 | 59177 | 4.2% |
| 0002 | 159 | < 0.1% |
| 0001 | 159 | < 0.1% |
| 0003 | 149 | < 0.1% |
| 0004 | 145 | < 0.1% |
| 0005 | 137 | < 0.1% |
| 0006 | 116 | < 0.1% |
| 0007 | 113 | < 0.1% |
| 0008 | 105 | < 0.1% |
| Other values (652937) | 674632 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 742844 | 7.9% |
| 734892 | 7.9% | |
| M | 712585 | 7.6% |
| N | 674519 | 7.2% |
| U | 674214 | 7.2% |
| 0 | 557394 | 6.0% |
| P | 521957 | 5.6% |
| A | 511374 | 5.5% |
| L | 497601 | 5.3% |
| 1 | 444334 | 4.8% |
| Other values (58) | 3275404 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4546936 | |
| Decimal Number | 4063828 | |
| Space Separator | 734892 | 7.9% |
| Other Punctuation | 741 | < 0.1% |
| Lowercase Letter | 690 | < 0.1% |
| Dash Punctuation | 30 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 742844 | |
| M | 712585 | |
| N | 674519 | |
| U | 674214 | |
| P | 521957 | |
| A | 511374 | |
| L | 497601 | |
| D | 65264 | 1.4% |
| C | 43992 | 1.0% |
| O | 38427 | 0.8% |
| Other values (16) | 64159 | 1.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 130 | |
| b | 126 | |
| d | 61 | |
| e | 54 | |
| c | 50 | 7.2% |
| o | 38 | 5.5% |
| l | 31 | 4.5% |
| f | 27 | 3.9% |
| r | 26 | 3.8% |
| k | 23 | 3.3% |
| Other values (16) | 124 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 557394 | |
| 1 | 444334 | |
| 3 | 432709 | |
| 5 | 423320 | |
| 2 | 419515 | |
| 4 | 412173 | |
| 6 | 395612 | |
| 7 | 350867 | |
| 8 | 318934 | |
| 9 | 308970 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 704 | |
| " | 35 | 4.7% |
| , | 2 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 734892 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4799492 | |
| Latin | 4547626 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 742844 | |
| M | 712585 | |
| N | 674519 | |
| U | 674214 | |
| P | 521957 | |
| A | 511374 | |
| L | 497601 | |
| D | 65264 | 1.4% |
| C | 43992 | 1.0% |
| O | 38427 | 0.8% |
| Other values (42) | 64849 | 1.4% |
Common
| Value | Count | Frequency (%) |
| 734892 | ||
| 0 | 557394 | |
| 1 | 444334 | |
| 3 | 432709 | |
| 5 | 423320 | |
| 2 | 419515 | |
| 4 | 412173 | |
| 6 | 395612 | |
| 7 | 350867 | |
| 8 | 318934 | |
| Other values (6) | 309742 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9347118 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 742844 | 7.9% |
| 734892 | 7.9% | |
| M | 712585 | 7.6% |
| N | 674519 | 7.2% |
| U | 674214 | 7.2% |
| 0 | 557394 | 6.0% |
| P | 521957 | 5.6% |
| A | 511374 | 5.5% |
| L | 497601 | 5.3% |
| 1 | 444334 | 4.8% |
| Other values (58) | 3275404 |
recordNumber
Text
Missing 
| Distinct | 39872 |
|---|---|
| Distinct (%) | 82.1% |
| Missing | 675939 |
| Missing (%) | 93.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 5 |
| Mean length | 6.205336737 |
| Min length | 1 |
Unique
| Unique | 37721 ? |
|---|---|
| Unique (%) | 77.7% |
Sample
| 1st row | PALMER LOC 1479 |
|---|---|
| 2nd row | 75432 |
| 3rd row | H-11 |
| 4th row | E73-59 |
| 5th row | Gaxin Loc 178-36 |
| Value | Count | Frequency (%) |
| loc | 1685 | 2.9% |
| emlong | 951 | 1.7% |
| urbac | 803 | 1.4% |
| olson | 263 | 0.5% |
| sample | 209 | 0.4% |
| hass | 177 | 0.3% |
| rb | 171 | 0.3% |
| c-29 | 169 | 0.3% |
| gibson | 163 | 0.3% |
| wyo | 162 | 0.3% |
| Other values (38506) | 52476 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 30021 | 10.0% |
| 5 | 27939 | 9.3% |
| 7 | 23690 | 7.9% |
| 2 | 21570 | 7.2% |
| 3 | 20657 | 6.9% |
| 6 | 18998 | 6.3% |
| 8 | 18791 | 6.2% |
| 0 | 17388 | 5.8% |
| 4 | 17006 | 5.6% |
| - | 16559 | 5.5% |
| Other values (67) | 88768 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 211386 | |
| Uppercase Letter | 58763 | 19.5% |
| Dash Punctuation | 16559 | 5.5% |
| Space Separator | 8660 | 2.9% |
| Other Punctuation | 3199 | 1.1% |
| Lowercase Letter | 2471 | 0.8% |
| Math Symbol | 145 | < 0.1% |
| Close Punctuation | 102 | < 0.1% |
| Open Punctuation | 101 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 5593 | 9.5% |
| E | 4986 | 8.5% |
| L | 4981 | 8.5% |
| C | 4891 | 8.3% |
| S | 4262 | 7.3% |
| A | 4151 | 7.1% |
| M | 3190 | 5.4% |
| R | 3078 | 5.2% |
| N | 3020 | 5.1% |
| B | 2373 | 4.0% |
| Other values (16) | 18238 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 425 | |
| n | 315 | |
| a | 217 | |
| y | 190 | |
| l | 189 | |
| c | 189 | |
| e | 172 | |
| i | 169 | 6.8% |
| r | 167 | 6.8% |
| t | 82 | 3.3% |
| Other values (14) | 356 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 30021 | |
| 5 | 27939 | |
| 7 | 23690 | |
| 2 | 21570 | |
| 3 | 20657 | |
| 6 | 18998 | |
| 8 | 18791 | |
| 0 | 17388 | |
| 4 | 17006 | |
| 9 | 15326 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1630 | |
| . | 955 | |
| , | 516 | 16.1% |
| ? | 56 | 1.8% |
| ' | 22 | 0.7% |
| ; | 12 | 0.4% |
| # | 5 | 0.2% |
| : | 2 | 0.1% |
| & | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 135 | |
| = | 10 | 6.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 100 | |
| } | 2 | 2.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16559 |
Space Separator
| Value | Count | Frequency (%) |
| 8660 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 101 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 240153 | |
| Latin | 61234 | 20.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 5593 | 9.1% |
| E | 4986 | 8.1% |
| L | 4981 | 8.1% |
| C | 4891 | 8.0% |
| S | 4262 | 7.0% |
| A | 4151 | 6.8% |
| M | 3190 | 5.2% |
| R | 3078 | 5.0% |
| N | 3020 | 4.9% |
| B | 2373 | 3.9% |
| Other values (40) | 20709 |
Common
| Value | Count | Frequency (%) |
| 1 | 30021 | |
| 5 | 27939 | |
| 7 | 23690 | |
| 2 | 21570 | |
| 3 | 20657 | |
| 6 | 18998 | |
| 8 | 18791 | |
| 0 | 17388 | |
| 4 | 17006 | |
| - | 16559 | |
| Other values (17) | 27534 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 301387 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 30021 | 10.0% |
| 5 | 27939 | 9.3% |
| 7 | 23690 | 7.9% |
| 2 | 21570 | 7.2% |
| 3 | 20657 | 6.9% |
| 6 | 18998 | 6.3% |
| 8 | 18791 | 6.2% |
| 0 | 17388 | 5.8% |
| 4 | 17006 | 5.6% |
| - | 16559 | 5.5% |
| Other values (67) | 88768 |
recordedBy
Text
Missing 
| Distinct | 3957 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 563497 |
| Missing (%) | 77.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 119 |
|---|---|
| Median length | 61 |
| Mean length | 10.93147052 |
| Min length | 1 |
Unique
| Unique | 1329 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | R. Snow |
|---|---|
| 2nd row | D. Palmer |
| 3rd row | W. Woodring & L. Lupher |
| 4th row | James |
| 5th row | Ross |
| Value | Count | Frequency (%) |
| 21228 | 6.1% | |
| j | 19727 | 5.7% |
| r | 15376 | 4.5% |
| w | 14249 | 4.1% |
| a | 12060 | 3.5% |
| james | 11468 | 3.3% |
| l | 10757 | 3.1% |
| woodring | 9356 | 2.7% |
| pribyl | 8943 | 2.6% |
| c | 7362 | 2.1% |
| Other values (2560) | 214833 |
Most occurring characters
| Value | Count | Frequency (%) |
| 184348 | 10.5% | |
| e | 133592 | 7.6% |
| . | 131492 | 7.5% |
| r | 102132 | 5.8% |
| o | 91217 | 5.2% |
| l | 89319 | 5.1% |
| n | 89079 | 5.1% |
| a | 84651 | 4.8% |
| i | 80231 | 4.6% |
| s | 70452 | 4.0% |
| Other values (51) | 703574 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1075097 | |
| Uppercase Letter | 337569 | 19.2% |
| Space Separator | 184348 | 10.5% |
| Other Punctuation | 160539 | 9.1% |
| Dash Punctuation | 2462 | 0.1% |
| Open Punctuation | 36 | < 0.1% |
| Close Punctuation | 36 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 133592 | |
| r | 102132 | |
| o | 91217 | 8.5% |
| l | 89319 | 8.3% |
| n | 89079 | 8.3% |
| a | 84651 | 7.9% |
| i | 80231 | 7.5% |
| s | 70452 | 6.6% |
| t | 48464 | 4.5% |
| d | 48173 | 4.5% |
| Other values (18) | 237787 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 36000 | 10.7% |
| W | 33626 | 10.0% |
| A | 27177 | 8.1% |
| R | 24357 | 7.2% |
| P | 20822 | 6.2% |
| C | 20595 | 6.1% |
| M | 19813 | 5.9% |
| S | 19479 | 5.8% |
| L | 18797 | 5.6% |
| H | 15162 | 4.5% |
| Other values (15) | 101741 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 131492 | |
| & | 21228 | 13.2% |
| , | 7789 | 4.9% |
| ' | 30 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 184348 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2462 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 36 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 36 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1412666 | |
| Common | 347421 | 19.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 133592 | 9.5% |
| r | 102132 | 7.2% |
| o | 91217 | 6.5% |
| l | 89319 | 6.3% |
| n | 89079 | 6.3% |
| a | 84651 | 6.0% |
| i | 80231 | 5.7% |
| s | 70452 | 5.0% |
| t | 48464 | 3.4% |
| d | 48173 | 3.4% |
| Other values (43) | 575356 |
Common
| Value | Count | Frequency (%) |
| 184348 | ||
| . | 131492 | |
| & | 21228 | 6.1% |
| , | 7789 | 2.2% |
| - | 2462 | 0.7% |
| ( | 36 | < 0.1% |
| ) | 36 | < 0.1% |
| ' | 30 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1760046 | |
| None | 41 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 184348 | 10.5% | |
| e | 133592 | 7.6% |
| . | 131492 | 7.5% |
| r | 102132 | 5.8% |
| o | 91217 | 5.2% |
| l | 89319 | 5.1% |
| n | 89079 | 5.1% |
| a | 84651 | 4.8% |
| i | 80231 | 4.6% |
| s | 70452 | 4.0% |
| Other values (49) | 703533 |
None
| Value | Count | Frequency (%) |
| ú | 40 | |
| č | 1 | 2.4% |
individualCount
Real number (ℝ)
Skewed 
| Distinct | 686 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 303 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.84197706 |
| Minimum | 0 |
|---|---|
| Maximum | 15000 |
| Zeros | 158 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 16 |
| Maximum | 15000 |
| Range | 15000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 133.8531974 |
|---|---|
| Coefficient of variation (CV) | 11.30328126 |
| Kurtosis | 1553.85833 |
| Mean | 11.84197706 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 32.66226483 |
| Sum | 8576019 |
| Variance | 17916.67846 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 594864 | |
| 2 | 29629 | 4.1% |
| 3 | 14673 | 2.0% |
| 4 | 9858 | 1.4% |
| 5 | 7420 | 1.0% |
| 6 | 5780 | 0.8% |
| 7 | 4510 | 0.6% |
| 8 | 3695 | 0.5% |
| 10 | 3151 | 0.4% |
| 9 | 3129 | 0.4% |
| Other values (676) | 47496 | 6.6% |
| Value | Count | Frequency (%) |
| 0 | 158 | < 0.1% |
| 1 | 594864 | |
| 2 | 29629 | 4.1% |
| 3 | 14673 | 2.0% |
| 4 | 9858 | 1.4% |
| Value | Count | Frequency (%) |
| 15000 | 1 | < 0.1% |
| 9999 | 2 | < 0.1% |
| 9942 | 1 | < 0.1% |
| 9000 | 8 | |
| 8000 | 5 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1449016 | |
| P | 724508 | |
| R | 724508 | |
| S | 724508 | |
| N | 724508 | |
| T | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5071556 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1449016 | |
| P | 724508 | |
| R | 724508 | |
| S | 724508 | |
| N | 724508 | |
| T | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5071556 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1449016 | |
| P | 724508 | |
| R | 724508 | |
| S | 724508 | |
| N | 724508 | |
| T | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5071556 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1449016 | |
| P | 724508 | |
| R | 724508 | |
| S | 724508 | |
| N | 724508 | |
| T | 724508 |
preparations
Text
Missing 
| Distinct | 381 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 591600 |
| Missing (%) | 81.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 94 |
|---|---|
| Median length | 91 |
| Mean length | 16.14684594 |
| Min length | 3 |
Unique
| Unique | 130 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Boxes and vials |
|---|---|
| 2nd row | Thin sections |
| 3rd row | Secondary microslides |
| 4th row | Wet |
| 5th row | plastic container |
| Value | Count | Frequency (%) |
| microslide | 45697 | |
| microslides | 34837 | |
| secondary | 33230 | |
| remnants | 26629 | |
| thin | 24547 | |
| sections | 24011 | |
| no | 15071 | 5.8% |
| with | 10919 | 4.2% |
| unsectioned | 9109 | 3.5% |
| bottle | 3934 | 1.5% |
| Other values (53) | 32636 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | 8.0% |
| o | 167894 | 7.8% |
| c | 147453 | 6.9% |
| r | 146905 | 6.8% |
| d | 130804 | 6.1% |
| 127712 | 6.0% | |
| l | 92477 | 4.3% |
| Other values (41) | 501014 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1849130 | |
| Uppercase Letter | 159097 | 7.4% |
| Space Separator | 127712 | 6.0% |
| Other Punctuation | 10096 | 0.5% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | |
| o | 167894 | |
| c | 147453 | |
| r | 146905 | |
| d | 130804 | |
| l | 92477 | 5.0% |
| t | 85481 | 4.6% |
| Other values (14) | 246330 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 46146 | |
| S | 38065 | |
| T | 27401 | |
| U | 10261 | 6.4% |
| B | 6095 | 3.8% |
| P | 5926 | 3.7% |
| C | 5880 | 3.7% |
| O | 5094 | 3.2% |
| E | 3082 | 1.9% |
| R | 2197 | 1.4% |
| Other values (11) | 8950 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 9850 | |
| & | 157 | 1.6% |
| / | 89 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 127712 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2008227 | |
| Common | 137818 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | |
| o | 167894 | |
| c | 147453 | 7.3% |
| r | 146905 | 7.3% |
| d | 130804 | 6.5% |
| l | 92477 | 4.6% |
| t | 85481 | 4.3% |
| Other values (35) | 405427 |
Common
| Value | Count | Frequency (%) |
| 127712 | ||
| ; | 9850 | 7.1% |
| & | 157 | 0.1% |
| / | 89 | 0.1% |
| ( | 5 | < 0.1% |
| ) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2146045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | 8.0% |
| o | 167894 | 7.8% |
| c | 147453 | 6.9% |
| r | 146905 | 6.8% |
| d | 130804 | 6.1% |
| 127712 | 6.0% | |
| l | 92477 | 4.3% |
| Other values (41) | 501014 |
Missing 
| Distinct | 38195 |
|---|---|
| Distinct (%) | 44.3% |
| Missing | 638259 |
| Missing (%) | 88.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 1257 |
|---|---|
| Median length | 1240 |
| Mean length | 357.4557966 |
| Min length | 5 |
Unique
| Unique | 36384 ? |
|---|---|
| Unique (%) | 42.2% |
Sample
| 1st row | Specimen comments: Associated w/ #0343 and #0346. | Body size code: medium; Taphonomic Significance: Human modification | Features: Weathering, diagenesis: N/A; Burn Color: none; Burn Modification: none; Cut: 0; Scrape: 0; Chop: 0; Loading Notch: 0; Counterblow: 0; Anvil pit: 0; Carn pit: 0; Carn score: 0; Carn furrow: 0; Carn punct: 0; Carn crenulation: 0; Rodent gnaw: none |
|---|---|
| 2nd row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Information generated by NMNH Department of Paleobiology volunteers: Specimen count and preliminary identification to class. |
| 3rd row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Information generated by NMNH Department of Paleobiology volunteers: Specimen count and preliminary identification to class. |
| 4th row | The fossil is marked with the original Green River number and is often mistaken for the USNM number. That original Green River collection number is 75432.; Numbers associated with this fossil: 578683. 75432. 40193. |
| 5th row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Additional label information: This locality is at approximately the same horizon as USGS CENO LOC 5686, in which a shale fauna was collected | See USGS CENO LOC 5703; Verbatim Lithostratigraphy: Tejon Formation; Sandstone forming the upper member of the Tejon | Discontinuous lenses in a soft brownish sandstone, less than 100 feet stratigraphically below the overlying diatomaceous shale; Verbatim Chronostratigraphy: Eocene |
| Value | Count | Frequency (%) |
| the | 291111 | 6.9% |
| digitization | 174338 | 4.1% |
| of | 164357 | 3.9% |
| si | 100203 | 2.4% |
| collections | 99405 | 2.4% |
| number | 86263 | 2.0% |
| is | 85833 | 2.0% |
| mass | 74949 | 1.8% |
| dpo | 74947 | 1.8% |
| with | 57325 | 1.4% |
| Other values (66970) | 3009589 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4132071 | 13.4% | |
| i | 2608470 | 8.5% |
| t | 2311910 | 7.5% |
| o | 2139574 | 6.9% |
| e | 2129723 | 6.9% |
| n | 1708168 | 5.5% |
| a | 1671073 | 5.4% |
| r | 1554155 | 5.0% |
| s | 1249854 | 4.1% |
| c | 981043 | 3.2% |
| Other values (82) | 10344164 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22179429 | |
| Space Separator | 4132071 | 13.4% |
| Uppercase Letter | 3027854 | 9.8% |
| Decimal Number | 712264 | 2.3% |
| Other Punctuation | 536260 | 1.7% |
| Open Punctuation | 103223 | 0.3% |
| Close Punctuation | 103221 | 0.3% |
| Math Symbol | 26815 | 0.1% |
| Dash Punctuation | 8726 | < 0.1% |
| Connector Punctuation | 335 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2608470 | |
| t | 2311910 | |
| o | 2139574 | |
| e | 2129723 | |
| n | 1708168 | 7.7% |
| a | 1671073 | 7.5% |
| r | 1554155 | 7.0% |
| s | 1249854 | 5.6% |
| c | 981043 | 4.4% |
| l | 809850 | 3.7% |
| Other values (16) | 5015609 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 475177 | |
| S | 312569 | |
| N | 284886 | |
| I | 260808 | |
| P | 248493 | |
| D | 239558 | |
| T | 217566 | 7.2% |
| E | 157599 | 5.2% |
| A | 134747 | 4.5% |
| O | 129263 | 4.3% |
| Other values (16) | 567188 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 253963 | |
| : | 134709 | |
| ; | 123326 | |
| , | 10668 | 2.0% |
| / | 5315 | 1.0% |
| & | 3632 | 0.7% |
| ? | 1748 | 0.3% |
| " | 1387 | 0.3% |
| # | 984 | 0.2% |
| ' | 412 | 0.1% |
| Other values (5) | 116 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 96673 | |
| 5 | 95617 | |
| 0 | 89759 | |
| 4 | 70754 | |
| 2 | 67002 | |
| 7 | 66254 | |
| 8 | 64489 | |
| 6 | 57819 | |
| 3 | 52279 | |
| 9 | 51618 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 24725 | |
| + | 1585 | 5.9% |
| > | 212 | 0.8% |
| < | 199 | 0.7% |
| = | 94 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 103206 | |
| [ | 17 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 103204 | |
| ] | 17 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4132071 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8726 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 335 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 4 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25207283 | |
| Common | 5622922 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2608470 | 10.3% |
| t | 2311910 | 9.2% |
| o | 2139574 | 8.5% |
| e | 2129723 | 8.4% |
| n | 1708168 | 6.8% |
| a | 1671073 | 6.6% |
| r | 1554155 | 6.2% |
| s | 1249854 | 5.0% |
| c | 981043 | 3.9% |
| l | 809850 | 3.2% |
| Other values (42) | 8043463 |
Common
| Value | Count | Frequency (%) |
| 4132071 | ||
| . | 253963 | 4.5% |
| : | 134709 | 2.4% |
| ; | 123326 | 2.2% |
| ( | 103206 | 1.8% |
| ) | 103204 | 1.8% |
| 1 | 96673 | 1.7% |
| 5 | 95617 | 1.7% |
| 0 | 89759 | 1.6% |
| 4 | 70754 | 1.3% |
| Other values (30) | 419640 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30830198 | |
| Punctuation | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4132071 | 13.4% | |
| i | 2608470 | 8.5% |
| t | 2311910 | 7.5% |
| o | 2139574 | 6.9% |
| e | 2129723 | 6.9% |
| n | 1708168 | 5.5% |
| a | 1671073 | 5.4% |
| r | 1554155 | 5.0% |
| s | 1249854 | 4.1% |
| c | 981043 | 3.2% |
| Other values (79) | 10344157 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 4 | |
| ” | 2 | |
| … | 1 | 14.3% |
fieldNumber
Text
Missing 
| Distinct | 1516 |
|---|---|
| Distinct (%) | 34.0% |
| Missing | 720044 |
| Missing (%) | 99.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 209 |
|---|---|
| Median length | 45 |
| Mean length | 35.25537634 |
| Min length | 1 |
Unique
| Unique | 1229 ? |
|---|---|
| Unique (%) | 27.5% |
Sample
| 1st row | MTC-08009; MTC-08009B; MTC-08009B (A); MTC-08009B (B) |
|---|---|
| 2nd row | 217 |
| 3rd row | YP79-2 |
| 4th row | TDP31 |
| 5th row | 82-10; 82-19; 82-21; 82-22; 82-4; 82-6; 82-7 |
| Value | Count | Frequency (%) |
| 82-10 | 767 | 4.2% |
| 82-21 | 767 | 4.2% |
| 82-22 | 767 | 4.2% |
| 82-4 | 767 | 4.2% |
| 82-6 | 767 | 4.2% |
| 82-7 | 767 | 4.2% |
| 82-19 | 767 | 4.2% |
| mtc-04028dd | 329 | 1.8% |
| mtc-04028h | 329 | 1.8% |
| mtc-04028gg | 329 | 1.8% |
| Other values (1502) | 11759 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| - | 15944 | |
| 2 | 14513 | |
| 13651 | 8.7% | |
| ; | 12694 | 8.1% |
| 8 | 11928 | 7.6% |
| C | 9870 | 6.3% |
| M | 9201 | 5.8% |
| T | 8674 | 5.5% |
| 4 | 7381 | 4.7% |
| Other values (62) | 34692 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 72021 | |
| Uppercase Letter | 40992 | |
| Dash Punctuation | 15944 | 10.1% |
| Space Separator | 13651 | 8.7% |
| Other Punctuation | 12856 | 8.2% |
| Lowercase Letter | 1716 | 1.1% |
| Close Punctuation | 100 | 0.1% |
| Open Punctuation | 100 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 290 | |
| a | 205 | |
| m | 201 | |
| e | 185 | |
| l | 159 | |
| p | 150 | |
| o | 130 | |
| t | 77 | 4.5% |
| r | 70 | 4.1% |
| i | 55 | 3.2% |
| Other values (16) | 194 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 9870 | |
| M | 9201 | |
| T | 8674 | |
| A | 1535 | 3.7% |
| G | 1513 | 3.7% |
| B | 1509 | 3.7% |
| E | 1291 | 3.1% |
| D | 1285 | 3.1% |
| F | 1161 | 2.8% |
| H | 1137 | 2.8% |
| Other values (15) | 3816 | 9.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| 2 | 14513 | |
| 8 | 11928 | |
| 4 | 7381 | 10.2% |
| 1 | 6730 | 9.3% |
| 3 | 3699 | 5.1% |
| 5 | 3595 | 5.0% |
| 7 | 2000 | 2.8% |
| 9 | 1780 | 2.5% |
| 6 | 1563 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 12694 | |
| . | 62 | 0.5% |
| , | 49 | 0.4% |
| # | 34 | 0.3% |
| / | 10 | 0.1% |
| & | 4 | < 0.1% |
| ' | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15944 |
Space Separator
| Value | Count | Frequency (%) |
| 13651 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 100 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 114672 | |
| Latin | 42708 | 27.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 9870 | |
| M | 9201 | |
| T | 8674 | |
| A | 1535 | 3.6% |
| G | 1513 | 3.5% |
| B | 1509 | 3.5% |
| E | 1291 | 3.0% |
| D | 1285 | 3.0% |
| F | 1161 | 2.7% |
| H | 1137 | 2.7% |
| Other values (41) | 5532 |
Common
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| - | 15944 | |
| 2 | 14513 | |
| 13651 | ||
| ; | 12694 | |
| 8 | 11928 | |
| 4 | 7381 | 6.4% |
| 1 | 6730 | 5.9% |
| 3 | 3699 | 3.2% |
| 5 | 3595 | 3.1% |
| Other values (11) | 5705 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 157380 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| - | 15944 | |
| 2 | 14513 | |
| 13651 | 8.7% | |
| ; | 12694 | 8.1% |
| 8 | 11928 | 7.6% |
| C | 9870 | 6.3% |
| M | 9201 | 5.8% |
| T | 8674 | 5.5% |
| 4 | 7381 | 4.7% |
| Other values (62) | 34692 |
eventDate
Text
Missing 
| Distinct | 17205 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 474561 |
| Missing (%) | 65.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 7.503406722 |
| Min length | 4 |
Unique
| Unique | 5657 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | 1985-01-23 |
|---|---|
| 2nd row | 1974 |
| 3rd row | 1980 |
| 4th row | 1963 |
| 5th row | 1956 |
| Value | Count | Frequency (%) |
| 1999 | 3773 | 1.5% |
| 1980 | 3743 | 1.5% |
| 1982 | 3572 | 1.4% |
| 1984-02 | 3350 | 1.3% |
| 1998 | 3320 | 1.3% |
| 1997 | 3308 | 1.3% |
| 1995 | 3121 | 1.2% |
| 2001 | 2935 | 1.2% |
| 1974 | 2850 | 1.1% |
| 1971 | 2519 | 1.0% |
| Other values (17195) | 217456 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 389845 | |
| 9 | 321056 | |
| - | 287687 | |
| 0 | 240421 | |
| 8 | 128673 | 6.9% |
| 7 | 115893 | 6.2% |
| 2 | 104754 | 5.6% |
| 6 | 84492 | 4.5% |
| 4 | 67249 | 3.6% |
| 5 | 66574 | 3.5% |
| Other values (2) | 68810 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1585246 | |
| Dash Punctuation | 287687 | 15.3% |
| Other Punctuation | 2521 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 389845 | |
| 9 | 321056 | |
| 0 | 240421 | |
| 8 | 128673 | 8.1% |
| 7 | 115893 | 7.3% |
| 2 | 104754 | 6.6% |
| 6 | 84492 | 5.3% |
| 4 | 67249 | 4.2% |
| 5 | 66574 | 4.2% |
| 3 | 66289 | 4.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 287687 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2521 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1875454 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 389845 | |
| 9 | 321056 | |
| - | 287687 | |
| 0 | 240421 | |
| 8 | 128673 | 6.9% |
| 7 | 115893 | 6.2% |
| 2 | 104754 | 5.6% |
| 6 | 84492 | 4.5% |
| 4 | 67249 | 3.6% |
| 5 | 66574 | 3.5% |
| Other values (2) | 68810 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1875454 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 389845 | |
| 9 | 321056 | |
| - | 287687 | |
| 0 | 240421 | |
| 8 | 128673 | 6.9% |
| 7 | 115893 | 6.2% |
| 2 | 104754 | 5.6% |
| 6 | 84492 | 4.5% |
| 4 | 67249 | 3.6% |
| 5 | 66574 | 3.5% |
| Other values (2) | 68810 | 3.7% |
startDayOfYear
Real number (ℝ)
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 593923 |
| Missing (%) | 82.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 192.2771605 |
| Minimum | 1 |
|---|---|
| Maximum | 366 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 48 |
| Q1 | 137 |
| median | 201 |
| Q3 | 248 |
| 95-th percentile | 310 |
| Maximum | 366 |
| Range | 365 |
| Interquartile range (IQR) | 111 |
Descriptive statistics
| Standard deviation | 78.76365518 |
|---|---|
| Coefficient of variation (CV) | 0.409636043 |
| Kurtosis | -0.5202115862 |
| Mean | 192.2771605 |
| Median Absolute Deviation (MAD) | 56 |
| Skewness | -0.3074105143 |
| Sum | 25108513 |
| Variance | 6203.713378 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 198 | 1192 | 0.2% |
| 191 | 1146 | 0.2% |
| 195 | 1139 | 0.2% |
| 223 | 1099 | 0.2% |
| 196 | 1078 | 0.1% |
| 194 | 1065 | 0.1% |
| 251 | 1041 | 0.1% |
| 138 | 995 | 0.1% |
| 137 | 971 | 0.1% |
| 136 | 949 | 0.1% |
| Other values (356) | 119910 | 16.6% |
| (Missing) | 593923 |
| Value | Count | Frequency (%) |
| 1 | 20 | < 0.1% |
| 2 | 59 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 125 | |
| 5 | 150 |
| Value | Count | Frequency (%) |
| 366 | 8 | < 0.1% |
| 365 | 19 | |
| 364 | 22 | |
| 363 | 29 | |
| 362 | 8 | < 0.1% |
endDayOfYear
Real number (ℝ)
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 593923 |
| Missing (%) | 82.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 192.4239844 |
| Minimum | 1 |
|---|---|
| Maximum | 366 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 49 |
| Q1 | 137 |
| median | 201 |
| Q3 | 248 |
| 95-th percentile | 310 |
| Maximum | 366 |
| Range | 365 |
| Interquartile range (IQR) | 111 |
Descriptive statistics
| Standard deviation | 78.66872144 |
|---|---|
| Coefficient of variation (CV) | 0.4088301242 |
| Kurtosis | -0.526737264 |
| Mean | 192.4239844 |
| Median Absolute Deviation (MAD) | 56 |
| Skewness | -0.3026076446 |
| Sum | 25127686 |
| Variance | 6188.767733 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 198 | 1191 | 0.2% |
| 191 | 1141 | 0.2% |
| 195 | 1132 | 0.2% |
| 196 | 1085 | 0.1% |
| 194 | 1066 | 0.1% |
| 251 | 1041 | 0.1% |
| 138 | 996 | 0.1% |
| 137 | 969 | 0.1% |
| 136 | 949 | 0.1% |
| 203 | 935 | 0.1% |
| Other values (356) | 120080 | 16.6% |
| (Missing) | 593923 |
| Value | Count | Frequency (%) |
| 1 | 20 | < 0.1% |
| 2 | 58 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 125 | |
| 5 | 150 |
| Value | Count | Frequency (%) |
| 366 | 8 | < 0.1% |
| 365 | 19 | |
| 364 | 23 | |
| 363 | 27 | |
| 362 | 8 | < 0.1% |
year
Real number (ℝ)
Missing 
| Distinct | 190 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 474684 |
| Missing (%) | 65.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1960.539007 |
| Minimum | 1805 |
|---|---|
| Maximum | 2023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1805 |
|---|---|
| 5-th percentile | 1900 |
| Q1 | 1941 |
| median | 1970 |
| Q3 | 1982 |
| 95-th percentile | 1998 |
| Maximum | 2023 |
| Range | 218 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 30.37698185 |
|---|---|
| Coefficient of variation (CV) | 0.01549419916 |
| Kurtosis | 0.1001964871 |
| Mean | 1960.539007 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | -0.9196027756 |
| Sum | 489789697 |
| Variance | 922.7610263 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1980 | 7356 | 1.0% |
| 1981 | 7186 | 1.0% |
| 1982 | 7123 | 1.0% |
| 1976 | 6483 | 0.9% |
| 1971 | 6407 | 0.9% |
| 1973 | 5775 | 0.8% |
| 1984 | 5606 | 0.8% |
| 1974 | 5415 | 0.7% |
| 1999 | 5014 | 0.7% |
| 1987 | 4898 | 0.7% |
| Other values (180) | 188561 | 26.0% |
| (Missing) | 474684 |
| Value | Count | Frequency (%) |
| 1805 | 1 | < 0.1% |
| 1810 | 1 | < 0.1% |
| 1817 | 1 | < 0.1% |
| 1823 | 9 | |
| 1824 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2023 | 1 | < 0.1% |
| 2022 | 1 | < 0.1% |
| 2021 | 1 | < 0.1% |
| 2020 | 15 | |
| 2019 | 6 | < 0.1% |
month
Real number (ℝ)
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 572740 |
| Missing (%) | 79.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.718583628 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 11 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.66173107 |
|---|---|
| Coefficient of variation (CV) | 0.3961744346 |
| Kurtosis | -0.5834458529 |
| Mean | 6.718583628 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.2828535196 |
| Sum | 1019666 |
| Variance | 7.084812289 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 25644 | 3.5% |
| 7 | 25351 | 3.5% |
| 6 | 14941 | 2.1% |
| 5 | 14611 | 2.0% |
| 10 | 14469 | 2.0% |
| 9 | 14237 | 2.0% |
| 4 | 11303 | 1.6% |
| 2 | 8497 | 1.2% |
| 3 | 8211 | 1.1% |
| 11 | 6642 | 0.9% |
| Other values (2) | 7862 | 1.1% |
| (Missing) | 572740 |
| Value | Count | Frequency (%) |
| 1 | 4792 | 0.7% |
| 2 | 8497 | |
| 3 | 8211 | |
| 4 | 11303 | |
| 5 | 14611 |
| Value | Count | Frequency (%) |
| 12 | 3070 | 0.4% |
| 11 | 6642 | 0.9% |
| 10 | 14469 | |
| 9 | 14237 | |
| 8 | 25644 |
day
Real number (ℝ)
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 596444 |
| Missing (%) | 82.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.82372876 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.581155667 |
|---|---|
| Coefficient of variation (CV) | 0.542296686 |
| Kurtosis | -1.116069344 |
| Mean | 15.82372876 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.00373108911 |
| Sum | 2026450 |
| Variance | 73.63623258 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 5139 | 0.7% |
| 16 | 4975 | 0.7% |
| 18 | 4960 | 0.7% |
| 13 | 4630 | 0.6% |
| 20 | 4577 | 0.6% |
| 23 | 4547 | 0.6% |
| 8 | 4524 | 0.6% |
| 14 | 4502 | 0.6% |
| 15 | 4418 | 0.6% |
| 10 | 4351 | 0.6% |
| Other values (21) | 81441 | 11.2% |
| (Missing) | 596444 |
| Value | Count | Frequency (%) |
| 1 | 3812 | |
| 2 | 4062 | |
| 3 | 3807 | |
| 4 | 3694 | |
| 5 | 3756 |
| Value | Count | Frequency (%) |
| 31 | 2241 | |
| 30 | 3914 | |
| 29 | 3746 | |
| 28 | 4135 | |
| 27 | 4079 |
Missing 
| Distinct | 17805 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 445814 |
| Missing (%) | 61.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 61 |
|---|---|
| Median length | 11 |
| Mean length | 11.41229808 |
| Min length | 4 |
Unique
| Unique | 5871 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 23 JAN 1985 |
|---|---|
| 2nd row | April, 1928 |
| 3rd row | -- --- 1980 |
| 4th row | -- --- 1963 |
| 5th row | -- --- 1956 |
| Value | Count | Frequency (%) |
| 235730 | ||
| aug | 23677 | 2.9% |
| jul | 22916 | 2.8% |
| summer | 20031 | 2.5% |
| jun | 14619 | 1.8% |
| may | 14325 | 1.8% |
| oct | 14287 | 1.7% |
| to | 13955 | 1.7% |
| sep | 13176 | 1.6% |
| apr | 10764 | 1.3% |
| Other values (1210) | 433163 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 633590 | |
| 537949 | ||
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 3.3% |
| 0 | 101858 | 3.2% |
| 7 | 96225 | 3.0% |
| 2 | 94879 | 3.0% |
| 6 | 69663 | 2.2% |
| A | 63864 | 2.0% |
| Other values (59) | 779424 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1340357 | |
| Dash Punctuation | 633590 | |
| Space Separator | 537949 | |
| Uppercase Letter | 491521 | 15.5% |
| Lowercase Letter | 169648 | 5.3% |
| Other Punctuation | 6422 | 0.2% |
| Math Symbol | 1026 | < 0.1% |
| Open Punctuation | 13 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 40530 | |
| u | 32141 | |
| e | 26707 | |
| r | 24584 | |
| t | 7049 | 4.2% |
| a | 5225 | 3.1% |
| l | 4565 | 2.7% |
| g | 3709 | 2.2% |
| n | 3604 | 2.1% |
| p | 3590 | 2.1% |
| Other values (13) | 17944 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 63864 | |
| U | 61193 | |
| J | 48266 | 9.8% |
| O | 36480 | 7.4% |
| S | 35414 | 7.2% |
| T | 28143 | 5.7% |
| N | 24509 | 5.0% |
| P | 23974 | 4.9% |
| E | 23721 | 4.8% |
| G | 23661 | 4.8% |
| Other values (11) | 122296 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 7.9% |
| 0 | 101858 | 7.6% |
| 7 | 96225 | 7.2% |
| 2 | 94879 | 7.1% |
| 6 | 69663 | 5.2% |
| 3 | 60386 | 4.5% |
| 4 | 58552 | 4.4% |
| 5 | 55707 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3733 | |
| . | 1309 | 20.4% |
| ' | 650 | 10.1% |
| / | 634 | 9.9% |
| ? | 92 | 1.4% |
| ; | 2 | < 0.1% |
| & | 1 | < 0.1% |
| * | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1017 | |
| + | 5 | 0.5% |
| ~ | 4 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 633590 |
Space Separator
| Value | Count | Frequency (%) |
| 537949 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2519370 | |
| Latin | 661169 | 20.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 63864 | 9.7% |
| U | 61193 | 9.3% |
| J | 48266 | 7.3% |
| m | 40530 | 6.1% |
| O | 36480 | 5.5% |
| S | 35414 | 5.4% |
| u | 32141 | 4.9% |
| T | 28143 | 4.3% |
| e | 26707 | 4.0% |
| r | 24584 | 3.7% |
| Other values (34) | 263847 |
Common
| Value | Count | Frequency (%) |
| - | 633590 | |
| 537949 | ||
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 4.2% |
| 0 | 101858 | 4.0% |
| 7 | 96225 | 3.8% |
| 2 | 94879 | 3.8% |
| 6 | 69663 | 2.8% |
| 3 | 60386 | 2.4% |
| Other values (15) | 121733 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3180539 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 633590 | |
| 537949 | ||
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 3.3% |
| 0 | 101858 | 3.2% |
| 7 | 96225 | 3.0% |
| 2 | 94879 | 3.0% |
| 6 | 69663 | 2.2% |
| A | 63864 | 2.0% |
| Other values (59) | 779424 |
locationID
Text
Missing 
| Distinct | 66560 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 335037 |
| Missing (%) | 46.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 61 |
|---|---|
| Median length | 59 |
| Mean length | 5.757204002 |
| Min length | 1 |
Unique
| Unique | 40451 ? |
|---|---|
| Unique (%) | 10.4% |
Sample
| 1st row | 1612 |
|---|---|
| 2nd row | 06 |
| 3rd row | USGS LOC M533 |
| 4th row | 42246 |
| 5th row | 707A |
| Value | Count | Frequency (%) |
| 42246 | 30863 | 6.4% |
| 35k | 30551 | 6.3% |
| loc | 19929 | 4.1% |
| sta | 7656 | 1.6% |
| d | 5640 | 1.2% |
| site | 4020 | 0.8% |
| 40193 | 3269 | 0.7% |
| leg | 3132 | 0.7% |
| olson | 2904 | 0.6% |
| 41142 | 2897 | 0.6% |
| Other values (59519) | 370823 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 252324 | 11.3% |
| 1 | 209625 | 9.3% |
| 4 | 194523 | 8.7% |
| 3 | 152357 | 6.8% |
| 0 | 140257 | 6.3% |
| 5 | 136706 | 6.1% |
| 6 | 130433 | 5.8% |
| 7 | 107242 | 4.8% |
| 8 | 99787 | 4.5% |
| 9 | 93127 | 4.2% |
| Other values (71) | 725883 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1516381 | |
| Uppercase Letter | 531863 | 23.7% |
| Space Separator | 92213 | 4.1% |
| Dash Punctuation | 52032 | 2.3% |
| Other Punctuation | 28932 | 1.3% |
| Lowercase Letter | 15132 | 0.7% |
| Math Symbol | 3062 | 0.1% |
| Close Punctuation | 1336 | 0.1% |
| Open Punctuation | 1313 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 51448 | 9.7% |
| L | 50984 | 9.6% |
| C | 46019 | 8.7% |
| S | 44241 | 8.3% |
| A | 41228 | 7.8% |
| E | 37168 | 7.0% |
| K | 36506 | 6.9% |
| T | 30011 | 5.6% |
| I | 25951 | 4.9% |
| N | 20969 | 3.9% |
| Other values (16) | 147338 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2360 | |
| a | 1816 | |
| g | 1802 | |
| t | 1447 | |
| o | 1201 | |
| c | 1136 | |
| i | 1026 | |
| s | 789 | 5.2% |
| b | 707 | 4.7% |
| n | 562 | 3.7% |
| Other values (16) | 2286 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13863 | |
| , | 10529 | |
| * | 2055 | 7.1% |
| / | 1776 | 6.1% |
| ' | 442 | 1.5% |
| # | 178 | 0.6% |
| ; | 41 | 0.1% |
| ? | 34 | 0.1% |
| : | 7 | < 0.1% |
| " | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 252324 | |
| 1 | 209625 | |
| 4 | 194523 | |
| 3 | 152357 | |
| 0 | 140257 | |
| 5 | 136706 | |
| 6 | 130433 | |
| 7 | 107242 | |
| 8 | 99787 | 6.6% |
| 9 | 93127 | 6.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3039 | |
| = | 23 | 0.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1335 | |
| ] | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1304 | |
| [ | 9 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 92213 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52032 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1695269 | |
| Latin | 546995 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 51448 | 9.4% |
| L | 50984 | 9.3% |
| C | 46019 | 8.4% |
| S | 44241 | 8.1% |
| A | 41228 | 7.5% |
| E | 37168 | 6.8% |
| K | 36506 | 6.7% |
| T | 30011 | 5.5% |
| I | 25951 | 4.7% |
| N | 20969 | 3.8% |
| Other values (42) | 162470 |
Common
| Value | Count | Frequency (%) |
| 2 | 252324 | |
| 1 | 209625 | |
| 4 | 194523 | |
| 3 | 152357 | |
| 0 | 140257 | |
| 5 | 136706 | |
| 6 | 130433 | |
| 7 | 107242 | |
| 8 | 99787 | 5.9% |
| 9 | 93127 | 5.5% |
| Other values (19) | 178888 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2242264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 252324 | 11.3% |
| 1 | 209625 | 9.3% |
| 4 | 194523 | 8.7% |
| 3 | 152357 | 6.8% |
| 0 | 140257 | 6.3% |
| 5 | 136706 | 6.1% |
| 6 | 130433 | 5.8% |
| 7 | 107242 | 4.8% |
| 8 | 99787 | 4.5% |
| 9 | 93127 | 4.2% |
| Other values (71) | 725883 |
higherGeography
Text
Missing 
| Distinct | 4708 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 148417 |
| Missing (%) | 20.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 111 |
|---|---|
| Median length | 97 |
| Mean length | 42.17362361 |
| Min length | 4 |
Unique
| Unique | 1213 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North America, United States, Florida |
|---|---|
| 2nd row | Africa, Kenya, Marsabit |
| 3rd row | North America, United States, Nevada, Pershing County |
| 4th row | Cuba, Camaguey Prov |
| 5th row | North America, United States, North Carolina, Beaufort County |
| Value | Count | Frequency (%) |
| north | 537307 | |
| america | 480121 | |
| united | 421781 | |
| states | 421705 | |
| county | 259124 | 7.9% |
| carolina | 46843 | 1.4% |
| canada | 38942 | 1.2% |
| texas | 38273 | 1.2% |
| colorado | 35917 | 1.1% |
| beaufort | 33680 | 1.0% |
| Other values (2951) | 959718 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2697320 | 11.1% | |
| t | 2343978 | 9.6% |
| a | 2051368 | 8.4% |
| e | 1823223 | 7.5% |
| i | 1571709 | 6.5% |
| r | 1497295 | 6.2% |
| o | 1387848 | 5.7% |
| , | 1279367 | 5.3% |
| n | 1260166 | 5.2% |
| s | 766919 | 3.2% |
| Other values (58) | 7616652 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17040948 | |
| Uppercase Letter | 3272221 | 13.5% |
| Space Separator | 2697320 | 11.1% |
| Other Punctuation | 1284183 | 5.3% |
| Dash Punctuation | 1169 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2343978 | |
| a | 2051368 | |
| e | 1823223 | |
| i | 1571709 | |
| r | 1497295 | |
| o | 1387848 | |
| n | 1260166 | |
| s | 766919 | 4.5% |
| h | 662498 | 3.9% |
| c | 650930 | 3.8% |
| Other values (24) | 3025014 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 590551 | |
| A | 571156 | |
| C | 498307 | |
| S | 484309 | |
| U | 430602 | |
| B | 108340 | 3.3% |
| M | 87750 | 2.7% |
| O | 60025 | 1.8% |
| T | 59534 | 1.8% |
| P | 52139 | 1.6% |
| Other values (16) | 329508 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1279367 | |
| . | 3038 | 0.2% |
| ' | 1757 | 0.1% |
| ? | 21 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2697320 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1169 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20313169 | |
| Common | 3982676 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2343978 | 11.5% |
| a | 2051368 | 10.1% |
| e | 1823223 | 9.0% |
| i | 1571709 | 7.7% |
| r | 1497295 | 7.4% |
| o | 1387848 | 6.8% |
| n | 1260166 | 6.2% |
| s | 766919 | 3.8% |
| h | 662498 | 3.3% |
| c | 650930 | 3.2% |
| Other values (50) | 6297235 |
Common
| Value | Count | Frequency (%) |
| 2697320 | ||
| , | 1279367 | |
| . | 3038 | 0.1% |
| ' | 1757 | < 0.1% |
| - | 1169 | < 0.1% |
| ? | 21 | < 0.1% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24288672 | |
| None | 7173 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2697320 | 11.1% | |
| t | 2343978 | 9.7% |
| a | 2051368 | 8.4% |
| e | 1823223 | 7.5% |
| i | 1571709 | 6.5% |
| r | 1497295 | 6.2% |
| o | 1387848 | 5.7% |
| , | 1279367 | 5.3% |
| n | 1260166 | 5.2% |
| s | 766919 | 3.2% |
| Other values (50) | 7609479 |
None
| Value | Count | Frequency (%) |
| ó | 3473 | |
| í | 2116 | |
| á | 1037 | 14.5% |
| é | 539 | 7.5% |
| ñ | 4 | 0.1% |
| è | 2 | < 0.1% |
| ä | 1 | < 0.1% |
| ú | 1 | < 0.1% |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 195168 |
| Missing (%) | 26.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.51518684 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | AFRICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 480938 | |
| south_america | 11223 | 2.1% |
| europe | 9975 | 1.9% |
| asia | 9042 | 1.7% |
| oceania | 8130 | 1.5% |
| africa | 6638 | 1.3% |
| antarctica | 3394 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1042124 | |
| R | 993106 | |
| E | 520241 | |
| I | 519365 | |
| C | 513717 | |
| O | 510266 | |
| T | 498949 | |
| N | 492462 | |
| H | 492161 | |
| _ | 492161 | |
| Other values (5) | 550237 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6132628 | |
| Connector Punctuation | 492161 | 7.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1042124 | |
| R | 993106 | |
| E | 520241 | |
| I | 519365 | |
| C | 513717 | |
| O | 510266 | |
| T | 498949 | |
| N | 492462 | |
| H | 492161 | |
| M | 492161 | |
| Other values (4) | 58076 | 0.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 492161 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6132628 | |
| Common | 492161 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1042124 | |
| R | 993106 | |
| E | 520241 | |
| I | 519365 | |
| C | 513717 | |
| O | 510266 | |
| T | 498949 | |
| N | 492462 | |
| H | 492161 | |
| M | 492161 | |
| Other values (4) | 58076 | 0.9% |
Common
| Value | Count | Frequency (%) |
| _ | 492161 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6624789 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1042124 | |
| R | 993106 | |
| E | 520241 | |
| I | 519365 | |
| C | 513717 | |
| O | 510266 | |
| T | 498949 | |
| N | 492462 | |
| H | 492161 | |
| _ | 492161 | |
| Other values (5) | 550237 |
waterBody
Text
Missing 
| Distinct | 172 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 696851 |
| Missing (%) | 96.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 61 |
|---|---|
| Median length | 54 |
| Mean length | 21.95758759 |
| Min length | 8 |
Unique
| Unique | 58 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North Pacific Ocean |
| 3rd row | North Atlantic Ocean, Caribbean Sea |
| 4th row | North Atlantic Ocean |
| 5th row | North Atlantic Ocean |
| Value | Count | Frequency (%) |
| ocean | 26667 | |
| north | 18835 | |
| atlantic | 13621 | |
| pacific | 8356 | 8.8% |
| sea | 5778 | 6.1% |
| indian | 4034 | 4.3% |
| south | 2993 | 3.2% |
| timor | 2479 | 2.6% |
| of | 2181 | 2.3% |
| gulf | 2067 | 2.2% |
| Other values (146) | 7758 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 67112 | ||
| a | 66029 | |
| c | 60399 | |
| n | 52729 | 8.7% |
| t | 51240 | 8.4% |
| i | 42959 | 7.1% |
| e | 39252 | 6.5% |
| o | 28732 | 4.7% |
| O | 27050 | 4.5% |
| r | 26329 | 4.3% |
| Other values (39) | 145450 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 439588 | |
| Uppercase Letter | 92948 | 15.3% |
| Space Separator | 67112 | 11.1% |
| Other Punctuation | 7633 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 66029 | |
| c | 60399 | |
| n | 52729 | |
| t | 51240 | |
| i | 42959 | |
| e | 39252 | |
| o | 28732 | |
| r | 26329 | 6.0% |
| h | 22202 | 5.1% |
| l | 16619 | 3.8% |
| Other values (15) | 33098 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 27050 | |
| N | 18947 | |
| A | 14632 | |
| S | 9530 | 10.3% |
| P | 8558 | 9.2% |
| I | 4100 | 4.4% |
| M | 2579 | 2.8% |
| T | 2567 | 2.8% |
| G | 2317 | 2.5% |
| C | 1788 | 1.9% |
| Other values (12) | 880 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 67112 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7633 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 532536 | |
| Common | 74745 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 66029 | |
| c | 60399 | |
| n | 52729 | |
| t | 51240 | |
| i | 42959 | 8.1% |
| e | 39252 | 7.4% |
| o | 28732 | 5.4% |
| O | 27050 | 5.1% |
| r | 26329 | 4.9% |
| h | 22202 | 4.2% |
| Other values (37) | 115615 |
Common
| Value | Count | Frequency (%) |
| 67112 | ||
| , | 7633 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 607281 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 67112 | ||
| a | 66029 | |
| c | 60399 | |
| n | 52729 | 8.7% |
| t | 51240 | 8.4% |
| i | 42959 | 7.1% |
| e | 39252 | 6.5% |
| o | 28732 | 4.7% |
| O | 27050 | 4.5% |
| r | 26329 | 4.3% |
| Other values (39) | 145450 |
islandGroup
Text
Missing 
| Distinct | 33 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 723710 |
| Missing (%) | 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 16.78571429 |
| Min length | 5 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | Mariana Islands |
|---|---|
| 2nd row | Northern Mariana Islands |
| 3rd row | Gilbert Islands |
| 4th row | Gilbert Islands |
| 5th row | Aleutian Islands |
| Value | Count | Frequency (%) |
| islands | 765 | |
| marshall | 241 | 14.0% |
| mariana | 155 | 9.0% |
| gilbert | 135 | 7.9% |
| northern | 134 | 7.8% |
| marianas | 120 | 7.0% |
| solomon | 21 | 1.2% |
| ryukyu | 18 | 1.0% |
| hawaiian | 18 | 1.0% |
| antilles | 15 | 0.9% |
| Other values (26) | 97 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| 921 | ||
| d | 800 | 6.0% |
| I | 765 | 5.7% |
| M | 527 | 3.9% |
| i | 498 | 3.7% |
| Other values (36) | 2055 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10752 | |
| Uppercase Letter | 1720 | 12.8% |
| Space Separator | 921 | 6.9% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| d | 800 | 7.4% |
| i | 498 | 4.6% |
| h | 376 | 3.5% |
| e | 374 | 3.5% |
| t | 298 | 2.8% |
| Other values (13) | 577 | 5.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 765 | |
| M | 527 | |
| N | 140 | 8.1% |
| G | 135 | 7.8% |
| A | 25 | 1.5% |
| L | 24 | 1.4% |
| S | 24 | 1.4% |
| H | 18 | 1.0% |
| R | 18 | 1.0% |
| C | 11 | 0.6% |
| Other values (11) | 33 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 921 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12472 | |
| Common | 923 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| d | 800 | 6.4% |
| I | 765 | 6.1% |
| M | 527 | 4.2% |
| i | 498 | 4.0% |
| h | 376 | 3.0% |
| Other values (34) | 1677 |
Common
| Value | Count | Frequency (%) |
| 921 | ||
| . | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13395 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| 921 | ||
| d | 800 | 6.0% |
| I | 765 | 5.7% |
| M | 527 | 3.9% |
| i | 498 | 3.7% |
| Other values (36) | 2055 |
island
Text
Missing 
| Distinct | 87 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 714401 |
| Missing (%) | 98.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 4 |
| Mean length | 6.015335906 |
| Min length | 3 |
Unique
| Unique | 38 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Oahu |
|---|---|
| 2nd row | Oahu |
| 3rd row | Oahu |
| 4th row | Animasola Island |
| 5th row | Molokai |
| Value | Count | Frequency (%) |
| oahu | 5926 | |
| molokai | 2218 | 19.1% |
| saint | 944 | 8.1% |
| helena | 938 | 8.1% |
| atoll | 241 | 2.1% |
| saipan | 132 | 1.1% |
| guam | 129 | 1.1% |
| onotoa | 116 | 1.0% |
| martha's | 108 | 0.9% |
| vineyard | 108 | 0.9% |
| Other values (91) | 728 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| O | 6043 | |
| o | 5165 | |
| i | 4062 | 6.7% |
| l | 3813 | 6.3% |
| n | 2689 | 4.4% |
| k | 2476 | 4.1% |
| M | 2342 | 3.9% |
| Other values (40) | 10516 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47612 | |
| Uppercase Letter | 11591 | 19.1% |
| Space Separator | 1481 | 2.4% |
| Other Punctuation | 109 | 0.2% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| o | 5165 | |
| i | 4062 | 8.5% |
| l | 3813 | 8.0% |
| n | 2689 | 5.6% |
| k | 2476 | 5.2% |
| e | 2309 | 4.8% |
| t | 1709 | 3.6% |
| Other values (16) | 1698 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 6043 | |
| M | 2342 | 20.2% |
| S | 1177 | 10.2% |
| H | 941 | 8.1% |
| A | 273 | 2.4% |
| G | 140 | 1.2% |
| B | 138 | 1.2% |
| E | 125 | 1.1% |
| V | 121 | 1.0% |
| I | 89 | 0.8% |
| Other values (11) | 202 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1481 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 109 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59203 | |
| Common | 1594 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| O | 6043 | |
| o | 5165 | |
| i | 4062 | 6.9% |
| l | 3813 | 6.4% |
| n | 2689 | 4.5% |
| k | 2476 | 4.2% |
| M | 2342 | 4.0% |
| Other values (37) | 8922 |
Common
| Value | Count | Frequency (%) |
| 1481 | ||
| ' | 109 | 6.8% |
| - | 4 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 60794 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| O | 6043 | |
| o | 5165 | |
| i | 4062 | 6.7% |
| l | 3813 | 6.3% |
| n | 2689 | 4.4% |
| k | 2476 | 4.1% |
| M | 2342 | 3.9% |
| Other values (38) | 10513 |
None
| Value | Count | Frequency (%) |
| ñ | 2 | |
| é | 1 |
countryCode
Text
Missing 
| Distinct | 185 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 158422 |
| Missing (%) | 21.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | KE |
| 3rd row | US |
| 4th row | CU |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 428942 | |
| ca | 39076 | 6.9% |
| pa | 8629 | 1.5% |
| do | 6290 | 1.1% |
| mx | 3952 | 0.7% |
| co | 3623 | 0.6% |
| fr | 3541 | 0.6% |
| aq | 3460 | 0.6% |
| cr | 3282 | 0.6% |
| pr | 3114 | 0.6% |
| Other values (175) | 62177 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 434999 | |
| S | 434979 | |
| A | 57869 | 5.1% |
| C | 53653 | 4.7% |
| P | 19200 | 1.7% |
| E | 14200 | 1.3% |
| R | 12973 | 1.1% |
| O | 11631 | 1.0% |
| D | 10040 | 0.9% |
| M | 9508 | 0.8% |
| Other values (16) | 73120 | 6.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1132172 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 434999 | |
| S | 434979 | |
| A | 57869 | 5.1% |
| C | 53653 | 4.7% |
| P | 19200 | 1.7% |
| E | 14200 | 1.3% |
| R | 12973 | 1.1% |
| O | 11631 | 1.0% |
| D | 10040 | 0.9% |
| M | 9508 | 0.8% |
| Other values (16) | 73120 | 6.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1132172 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 434999 | |
| S | 434979 | |
| A | 57869 | 5.1% |
| C | 53653 | 4.7% |
| P | 19200 | 1.7% |
| E | 14200 | 1.3% |
| R | 12973 | 1.1% |
| O | 11631 | 1.0% |
| D | 10040 | 0.9% |
| M | 9508 | 0.8% |
| Other values (16) | 73120 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1132172 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 434999 | |
| S | 434979 | |
| A | 57869 | 5.1% |
| C | 53653 | 4.7% |
| P | 19200 | 1.7% |
| E | 14200 | 1.3% |
| R | 12973 | 1.1% |
| O | 11631 | 1.0% |
| D | 10040 | 0.9% |
| M | 9508 | 0.8% |
| Other values (16) | 73120 | 6.5% |
stateProvince
Text
Missing 
| Distinct | 892 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 226462 |
| Missing (%) | 31.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 23 |
| Mean length | 8.789222281 |
| Min length | 3 |
Unique
| Unique | 236 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Marsabit |
| 3rd row | Nevada |
| 4th row | Camaguey Prov |
| 5th row | North Carolina |
| Value | Count | Frequency (%) |
| carolina | 46813 | 7.5% |
| north | 45129 | 7.2% |
| texas | 38253 | 6.1% |
| colorado | 35917 | 5.8% |
| california | 32474 | 5.2% |
| columbia | 32203 | 5.2% |
| british | 32085 | 5.1% |
| alaska | 28545 | 4.6% |
| new | 23155 | 3.7% |
| wyoming | 22778 | 3.6% |
| Other values (878) | 287106 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | 10.2% |
| o | 412678 | 9.4% |
| r | 299951 | 6.9% |
| n | 262321 | 6.0% |
| l | 249350 | 5.7% |
| s | 213346 | 4.9% |
| e | 190372 | 4.3% |
| C | 155417 | 3.6% |
| t | 143584 | 3.3% |
| Other values (54) | 1382750 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3624857 | |
| Uppercase Letter | 625183 | 14.3% |
| Space Separator | 126412 | 2.9% |
| Dash Punctuation | 508 | < 0.1% |
| Other Punctuation | 475 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | |
| o | 412678 | |
| r | 299951 | |
| n | 262321 | 7.2% |
| l | 249350 | 6.9% |
| s | 213346 | 5.9% |
| e | 190372 | 5.3% |
| t | 143584 | 4.0% |
| h | 114639 | 3.2% |
| Other values (22) | 670948 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 155417 | |
| N | 87902 | |
| M | 48444 | 7.7% |
| T | 47635 | 7.6% |
| A | 45155 | 7.2% |
| B | 36744 | 5.9% |
| W | 32086 | 5.1% |
| H | 20814 | 3.3% |
| O | 19325 | 3.1% |
| I | 17859 | 2.9% |
| Other values (16) | 113802 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 425 | |
| ' | 50 | 10.5% |
Space Separator
| Value | Count | Frequency (%) |
| 126412 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 508 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4250040 | |
| Common | 127397 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | 10.5% |
| o | 412678 | 9.7% |
| r | 299951 | 7.1% |
| n | 262321 | 6.2% |
| l | 249350 | 5.9% |
| s | 213346 | 5.0% |
| e | 190372 | 4.5% |
| C | 155417 | 3.7% |
| t | 143584 | 3.4% |
| Other values (48) | 1255353 |
Common
| Value | Count | Frequency (%) |
| 126412 | ||
| - | 508 | 0.4% |
| . | 425 | 0.3% |
| ' | 50 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4371514 | |
| None | 5923 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | 10.2% |
| o | 412678 | 9.4% |
| r | 299951 | 6.9% |
| n | 262321 | 6.0% |
| l | 249350 | 5.7% |
| s | 213346 | 4.9% |
| e | 190372 | 4.4% |
| C | 155417 | 3.6% |
| t | 143584 | 3.3% |
| Other values (48) | 1376827 |
None
| Value | Count | Frequency (%) |
| ó | 2622 | |
| í | 1945 | |
| á | 1034 | 17.5% |
| é | 319 | 5.4% |
| è | 2 | < 0.1% |
| ñ | 1 | < 0.1% |
county
Text
Missing 
| Distinct | 1997 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 454433 |
| Missing (%) | 62.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 14.2528779 |
| Min length | 3 |
Unique
| Unique | 393 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Pershing County |
|---|---|
| 2nd row | Beaufort County |
| 3rd row | Brewster County |
| 4th row | Los Angeles County |
| 5th row | Honolulu County |
| Value | Count | Frequency (%) |
| county | 259124 | |
| beaufort | 33592 | 5.9% |
| brewster | 15677 | 2.8% |
| maui | 10401 | 1.8% |
| los | 8883 | 1.6% |
| angeles | 8865 | 1.6% |
| honolulu | 5926 | 1.0% |
| san | 4953 | 0.9% |
| lincoln | 4346 | 0.8% |
| culberson | 4132 | 0.7% |
| Other values (1945) | 212334 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| 298158 | 7.7% | |
| C | 289740 | 7.5% |
| y | 279783 | 7.3% |
| e | 215178 | 5.6% |
| a | 186491 | 4.8% |
| r | 177010 | 4.6% |
| Other values (55) | 850179 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2976107 | |
| Uppercase Letter | 570194 | 14.8% |
| Space Separator | 298158 | 7.7% |
| Other Punctuation | 4230 | 0.1% |
| Dash Punctuation | 657 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| y | 279783 | |
| e | 215178 | |
| a | 186491 | |
| r | 177010 | |
| l | 100058 | 3.4% |
| s | 96459 | 3.2% |
| Other values (23) | 368321 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 289740 | |
| B | 65415 | 11.5% |
| M | 27388 | 4.8% |
| S | 25040 | 4.4% |
| L | 22655 | 4.0% |
| P | 16991 | 3.0% |
| A | 16627 | 2.9% |
| H | 14879 | 2.6% |
| D | 12691 | 2.2% |
| W | 9829 | 1.7% |
| Other values (16) | 68939 | 12.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2609 | |
| ' | 1598 | |
| ? | 21 | 0.5% |
| , | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 298158 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 657 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3546301 | |
| Common | 303045 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| C | 289740 | 8.2% |
| y | 279783 | 7.9% |
| e | 215178 | 6.1% |
| a | 186491 | 5.3% |
| r | 177010 | 5.0% |
| l | 100058 | 2.8% |
| Other values (49) | 745234 |
Common
| Value | Count | Frequency (%) |
| 298158 | ||
| . | 2609 | 0.9% |
| ' | 1598 | 0.5% |
| - | 657 | 0.2% |
| ? | 21 | < 0.1% |
| , | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3848100 | |
| None | 1246 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| 298158 | 7.7% | |
| C | 289740 | 7.5% |
| y | 279783 | 7.3% |
| e | 215178 | 5.6% |
| a | 186491 | 4.8% |
| r | 177010 | 4.6% |
| Other values (48) | 848933 |
None
| Value | Count | Frequency (%) |
| ó | 851 | |
| é | 218 | 17.5% |
| í | 171 | 13.7% |
| á | 3 | 0.2% |
| ä | 1 | 0.1% |
| ñ | 1 | 0.1% |
| ú | 1 | 0.1% |
locality
Text
Missing 
| Distinct | 31755 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 560871 |
| Missing (%) | 77.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 471 |
|---|---|
| Median length | 316 |
| Mean length | 59.79365302 |
| Min length | 1 |
Unique
| Unique | 21088 ? |
|---|---|
| Unique (%) | 12.9% |
Sample
| 1st row | St. Andrew Bay |
|---|---|
| 2nd row | Nuevitas Bay, Between Nuevitas And Pastelillo |
| 3rd row | Palos Verdes Hills; East side of Deadman's Island |
| 4th row | North slope of San Pedro Hills, ravine S of harbor City, 4200 feet N and 53.5 degrees E from 342-foot hill, 100 feet up ravine from end of Bellepoint Street (W98-30) |
| 5th row | Coyote Springs Valley; spring |
| Value | Count | Frequency (%) |
| of | 120156 | 7.0% |
| 34919 | 2.0% | |
| and | 22265 | 1.3% |
| bay | 19665 | 1.1% |
| the | 18421 | 1.1% |
| on | 17778 | 1.0% |
| from | 16823 | 1.0% |
| n | 16777 | 1.0% |
| feet | 15757 | 0.9% |
| river | 15334 | 0.9% |
| Other values (34131) | 1421831 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1556089 | 15.9% | |
| e | 696401 | 7.1% |
| a | 667613 | 6.8% |
| o | 563197 | 5.8% |
| n | 459256 | 4.7% |
| t | 454549 | 4.6% |
| r | 411335 | 4.2% |
| i | 400968 | 4.1% |
| l | 325764 | 3.3% |
| s | 321160 | 3.3% |
| Other values (90) | 3928122 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5945227 | |
| Space Separator | 1556089 | 15.9% |
| Uppercase Letter | 1177808 | 12.0% |
| Decimal Number | 550644 | 5.6% |
| Other Punctuation | 394583 | 4.0% |
| Dash Punctuation | 53241 | 0.5% |
| Open Punctuation | 40436 | 0.4% |
| Close Punctuation | 40130 | 0.4% |
| Math Symbol | 26252 | 0.3% |
| Connector Punctuation | 35 | < 0.1% |
| Other values (2) | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 696401 | |
| a | 667613 | |
| o | 563197 | 9.5% |
| n | 459256 | 7.7% |
| t | 454549 | 7.6% |
| r | 411335 | 6.9% |
| i | 400968 | 6.7% |
| l | 325764 | 5.5% |
| s | 321160 | 5.4% |
| f | 214183 | 3.6% |
| Other values (21) | 1430801 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 174300 | |
| C | 112455 | 9.5% |
| O | 84488 | 7.2% |
| N | 76065 | 6.5% |
| B | 74827 | 6.4% |
| R | 70201 | 6.0% |
| P | 66728 | 5.7% |
| A | 62185 | 5.3% |
| W | 51082 | 4.3% |
| T | 49504 | 4.2% |
| Other values (17) | 355973 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 179506 | |
| . | 103955 | |
| ; | 73054 | |
| / | 19087 | 4.8% |
| ' | 7147 | 1.8% |
| : | 4428 | 1.1% |
| # | 4037 | 1.0% |
| " | 1994 | 0.5% |
| ? | 703 | 0.2% |
| & | 599 | 0.2% |
| Other values (5) | 73 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 125210 | |
| 0 | 82093 | |
| 2 | 69469 | |
| 5 | 50957 | |
| 3 | 50931 | |
| 4 | 49415 | 9.0% |
| 6 | 36615 | 6.6% |
| 7 | 31244 | 5.7% |
| 8 | 27594 | 5.0% |
| 9 | 27116 | 4.9% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 22235 | |
| + | 2928 | 11.2% |
| = | 1045 | 4.0% |
| ± | 36 | 0.1% |
| ~ | 8 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37729 | |
| { | 2081 | 5.1% |
| [ | 626 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37422 | |
| } | 2082 | 5.2% |
| ] | 626 | 1.6% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 | |
| € | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1556089 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 53241 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 35 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7123035 | |
| Common | 2661419 | 27.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 696401 | 9.8% |
| a | 667613 | 9.4% |
| o | 563197 | 7.9% |
| n | 459256 | 6.4% |
| t | 454549 | 6.4% |
| r | 411335 | 5.8% |
| i | 400968 | 5.6% |
| l | 325764 | 4.6% |
| s | 321160 | 4.5% |
| f | 214183 | 3.0% |
| Other values (48) | 2608609 |
Common
| Value | Count | Frequency (%) |
| 1556089 | ||
| , | 179506 | 6.7% |
| 1 | 125210 | 4.7% |
| . | 103955 | 3.9% |
| 0 | 82093 | 3.1% |
| ; | 73054 | 2.7% |
| 2 | 69469 | 2.6% |
| - | 53241 | 2.0% |
| 5 | 50957 | 1.9% |
| 3 | 50931 | 1.9% |
| Other values (32) | 316914 | 11.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9784239 | |
| None | 213 | < 0.1% |
| Currency Symbols | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1556089 | 15.9% | |
| e | 696401 | 7.1% |
| a | 667613 | 6.8% |
| o | 563197 | 5.8% |
| n | 459256 | 4.7% |
| t | 454549 | 4.6% |
| r | 411335 | 4.2% |
| i | 400968 | 4.1% |
| l | 325764 | 3.3% |
| s | 321160 | 3.3% |
| Other values (81) | 3927907 |
None
| Value | Count | Frequency (%) |
| ñ | 93 | |
| ± | 36 | 16.9% |
| Ã | 36 | 16.9% |
| í | 27 | 12.7% |
| á | 14 | 6.6% |
| ° | 4 | 1.9% |
| é | 2 | 0.9% |
| ö | 1 | 0.5% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 2 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 724311 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 88 |
|---|---|
| Median length | 88 |
| Mean length | 81.14720812 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
|---|---|
| 2nd row | Approx.450-500ft Above Base Of Fm |
| 3rd row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
| 4th row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
| 5th row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
| Value | Count | Frequency (%) |
| elevation | 161 | 5.5% |
| by | 161 | 5.5% |
| 2023 | 161 | 5.5% |
| decemeber | 161 | 5.5% |
| 4 | 161 | 5.5% |
| mead | 161 | 5.5% |
| jim | 161 | 5.5% |
| dr | 161 | 5.5% |
| on | 161 | 5.5% |
| earth | 161 | 5.5% |
| Other values (38) | 1300 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2713 | ||
| e | 1696 | 10.6% |
| r | 1185 | 7.4% |
| o | 1092 | 6.8% |
| a | 1023 | 6.4% |
| m | 656 | 4.1% |
| t | 562 | 3.5% |
| v | 533 | 3.3% |
| i | 527 | 3.3% |
| d | 497 | 3.1% |
| Other values (45) | 5502 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10285 | |
| Space Separator | 2713 | 17.0% |
| Uppercase Letter | 1740 | 10.9% |
| Decimal Number | 968 | 6.1% |
| Other Punctuation | 239 | 1.5% |
| Math Symbol | 29 | 0.2% |
| Dash Punctuation | 12 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1696 | |
| r | 1185 | |
| o | 1092 | |
| a | 1023 | |
| m | 656 | 6.4% |
| t | 562 | 5.5% |
| v | 533 | 5.2% |
| i | 527 | 5.1% |
| d | 497 | 4.8% |
| n | 407 | 4.0% |
| Other values (13) | 2107 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 322 | |
| E | 322 | |
| C | 194 | |
| M | 185 | |
| J | 161 | |
| G | 161 | |
| R | 161 | |
| A | 64 | 3.7% |
| B | 53 | 3.0% |
| O | 25 | 1.4% |
| Other values (8) | 92 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 354 | |
| 0 | 209 | |
| 4 | 173 | |
| 3 | 161 | |
| 5 | 40 | 4.1% |
| 1 | 25 | 2.6% |
| 6 | 5 | 0.5% |
| 8 | 1 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 196 | |
| , | 42 | 17.6% |
| / | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 2713 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 29 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12025 | |
| Common | 3961 | 24.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1696 | |
| r | 1185 | 9.9% |
| o | 1092 | 9.1% |
| a | 1023 | 8.5% |
| m | 656 | 5.5% |
| t | 562 | 4.7% |
| v | 533 | 4.4% |
| i | 527 | 4.4% |
| d | 497 | 4.1% |
| n | 407 | 3.4% |
| Other values (31) | 3847 |
Common
| Value | Count | Frequency (%) |
| 2713 | ||
| 2 | 354 | 8.9% |
| 0 | 209 | 5.3% |
| . | 196 | 4.9% |
| 4 | 173 | 4.4% |
| 3 | 161 | 4.1% |
| , | 42 | 1.1% |
| 5 | 40 | 1.0% |
| + | 29 | 0.7% |
| 1 | 25 | 0.6% |
| Other values (4) | 19 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15986 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2713 | ||
| e | 1696 | 10.6% |
| r | 1185 | 7.4% |
| o | 1092 | 6.8% |
| a | 1023 | 6.4% |
| m | 656 | 4.1% |
| t | 562 | 3.5% |
| v | 533 | 3.3% |
| i | 527 | 3.3% |
| d | 497 | 3.1% |
| Other values (45) | 5502 |
verbatimDepth
Text
Missing 
| Distinct | 17 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 724424 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 5.523809524 |
| Min length | 4 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 10.7% |
Sample
| 1st row | reef |
|---|---|
| 2nd row | Beach |
| 3rd row | ?48 Ms |
| 4th row | Beach |
| 5th row | Intertidal |
| Value | Count | Frequency (%) |
| reef | 30 | |
| beach | 25 | |
| low | 9 | 8.3% |
| ms | 8 | 7.3% |
| water | 7 | 6.4% |
| 48 | 6 | 5.5% |
| no.4 | 4 | 3.7% |
| mnb | 3 | 2.8% |
| 57ms | 2 | 1.8% |
| 25 | 2 | 1.8% |
| Other values (12) | 13 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | 8.6% |
| a | 37 | 8.0% |
| f | 31 | 6.7% |
| c | 26 | 5.6% |
| h | 25 | 5.4% |
| 25 | 5.4% | |
| b | 18 | 3.9% |
| o | 13 | 2.8% |
| t | 13 | 2.8% |
| Other values (30) | 140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 339 | |
| Uppercase Letter | 51 | 11.0% |
| Decimal Number | 32 | 6.9% |
| Space Separator | 25 | 5.4% |
| Other Punctuation | 16 | 3.4% |
| Dash Punctuation | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | |
| a | 37 | 10.9% |
| f | 31 | 9.1% |
| c | 26 | 7.7% |
| h | 25 | 7.4% |
| b | 18 | 5.3% |
| o | 13 | 3.8% |
| t | 13 | 3.8% |
| s | 10 | 2.9% |
| Other values (7) | 30 | 8.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 12 | |
| B | 10 | |
| L | 9 | |
| W | 8 | |
| N | 4 | 7.8% |
| F | 2 | 3.9% |
| A | 1 | 2.0% |
| S | 1 | 2.0% |
| U | 1 | 2.0% |
| C | 1 | 2.0% |
| Other values (2) | 2 | 3.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 11 | |
| 8 | 8 | |
| 5 | 4 | 12.5% |
| 7 | 3 | 9.4% |
| 0 | 3 | 9.4% |
| 2 | 2 | 6.2% |
| 3 | 1 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10 | |
| ? | 6 |
Space Separator
| Value | Count | Frequency (%) |
| 25 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 390 | |
| Common | 74 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | |
| a | 37 | 9.5% |
| f | 31 | 7.9% |
| c | 26 | 6.7% |
| h | 25 | 6.4% |
| b | 18 | 4.6% |
| o | 13 | 3.3% |
| t | 13 | 3.3% |
| M | 12 | 3.1% |
| Other values (19) | 79 |
Common
| Value | Count | Frequency (%) |
| 25 | ||
| 4 | 11 | |
| . | 10 | 13.5% |
| 8 | 8 | 10.8% |
| ? | 6 | 8.1% |
| 5 | 4 | 5.4% |
| 7 | 3 | 4.1% |
| 0 | 3 | 4.1% |
| 2 | 2 | 2.7% |
| - | 1 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 464 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | 8.6% |
| a | 37 | 8.0% |
| f | 31 | 6.7% |
| c | 26 | 5.6% |
| h | 25 | 5.4% |
| 25 | 5.4% | |
| b | 18 | 3.9% |
| o | 13 | 2.8% |
| t | 13 | 2.8% |
| Other values (30) | 140 |
decimalLatitude
Real number (ℝ)
Missing 
| Distinct | 34309 |
|---|---|
| Distinct (%) | 33.0% |
| Missing | 620570 |
| Missing (%) | 85.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.17761578 |
| Minimum | -77.9033 |
|---|---|
| Maximum | 89.13 |
| Zeros | 12 |
| Zeros (%) | < 0.1% |
| Negative | 5725 |
| Negative (%) | 0.8% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | -77.9033 |
|---|---|
| 5-th percentile | -9.0417 |
| Q1 | 30.2267 |
| median | 37.54725 |
| Q3 | 45.743025 |
| 95-th percentile | 59.895255 |
| Maximum | 89.13 |
| Range | 167.0333 |
| Interquartile range (IQR) | 15.516325 |
Descriptive statistics
| Standard deviation | 18.98229075 |
|---|---|
| Coefficient of variation (CV) | 0.5246971185 |
| Kurtosis | 4.688030722 |
| Mean | 36.17761578 |
| Median Absolute Deviation (MAD) | 7.40415 |
| Skewness | -1.613703618 |
| Sum | 3760229.028 |
| Variance | 360.3273622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44.6458 | 1686 | 0.2% |
| 17.5 | 673 | 0.1% |
| 29.8119 | 329 | < 0.1% |
| 33.1767 | 323 | < 0.1% |
| 34.6405 | 307 | < 0.1% |
| 38.8295 | 287 | < 0.1% |
| 41.1458 | 279 | < 0.1% |
| 48.1104 | 243 | < 0.1% |
| 40.6184 | 235 | < 0.1% |
| 31.6767 | 227 | < 0.1% |
| Other values (34299) | 99349 | 13.7% |
| (Missing) | 620570 |
| Value | Count | Frequency (%) |
| -77.9033 | 5 | < 0.1% |
| -77.58 | 1 | < 0.1% |
| -77.57 | 5 | < 0.1% |
| -77.5 | 15 | |
| -76.98 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 89.13 | 3 | < 0.1% |
| 88.7817 | 9 | |
| 88.515 | 7 | |
| 88.0367 | 7 | |
| 87.75 | 7 |
decimalLongitude
Real number (ℝ)
Missing 
| Distinct | 35343 |
|---|---|
| Distinct (%) | 34.0% |
| Missing | 620570 |
| Missing (%) | 85.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -84.45552615 |
| Minimum | -179.57 |
|---|---|
| Maximum | 179.8 |
| Zeros | 19 |
| Zeros (%) | < 0.1% |
| Negative | 95623 |
| Negative (%) | 13.2% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | -179.57 |
|---|---|
| 5-th percentile | -156.775 |
| Q1 | -122.706 |
| median | -87.3072 |
| Q3 | -75.610425 |
| 95-th percentile | 88.7181 |
| Maximum | 179.8 |
| Range | 359.37 |
| Interquartile range (IQR) | 47.095575 |
Descriptive statistics
| Standard deviation | 63.087641 |
|---|---|
| Coefficient of variation (CV) | -0.7469924571 |
| Kurtosis | 5.28951012 |
| Mean | -84.45552615 |
| Median Absolute Deviation (MAD) | 17.5088 |
| Skewness | 2.138850984 |
| Sum | -8778138.477 |
| Variance | 3980.050447 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -123.908 | 1686 | 0.2% |
| -95.0833 | 673 | 0.1% |
| -103.252 | 329 | < 0.1% |
| -98.6878 | 321 | < 0.1% |
| -105.851 | 307 | < 0.1% |
| -76.8473 | 287 | < 0.1% |
| -115.358 | 279 | < 0.1% |
| -123.934 | 243 | < 0.1% |
| -108.207 | 235 | < 0.1% |
| -123.18 | 230 | < 0.1% |
| Other values (35333) | 99348 | 13.7% |
| (Missing) | 620570 |
| Value | Count | Frequency (%) |
| -179.57 | 1 | < 0.1% |
| -179.556 | 12 | |
| -179.555 | 4 | < 0.1% |
| -179.55 | 4 | < 0.1% |
| -179 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 179.8 | 1 | |
| 179.58 | 1 | |
| 179.5 | 1 | |
| 179.137 | 2 | |
| 179.08 | 2 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 654265 |
| Missing (%) | 90.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 70243 | |
| minutes | 70243 | |
| seconds | 70243 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| 140486 | 8.7% | |
| n | 140486 | 8.7% |
| D | 70243 | 4.3% |
| g | 70243 | 4.3% |
| r | 70243 | 4.3% |
| M | 70243 | 4.3% |
| i | 70243 | 4.3% |
| u | 70243 | 4.3% |
| Other values (5) | 351215 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1264374 | |
| Uppercase Letter | 210729 | 13.0% |
| Space Separator | 140486 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| n | 140486 | 11.1% |
| g | 70243 | 5.6% |
| r | 70243 | 5.6% |
| i | 70243 | 5.6% |
| u | 70243 | 5.6% |
| t | 70243 | 5.6% |
| c | 70243 | 5.6% |
| o | 70243 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 70243 | |
| M | 70243 | |
| S | 70243 |
Space Separator
| Value | Count | Frequency (%) |
| 140486 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1475103 | |
| Common | 140486 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| n | 140486 | 9.5% |
| D | 70243 | 4.8% |
| g | 70243 | 4.8% |
| r | 70243 | 4.8% |
| M | 70243 | 4.8% |
| i | 70243 | 4.8% |
| u | 70243 | 4.8% |
| t | 70243 | 4.8% |
| Other values (4) | 280972 |
Common
| Value | Count | Frequency (%) |
| 140486 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1615589 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| 140486 | 8.7% | |
| n | 140486 | 8.7% |
| D | 70243 | 4.3% |
| g | 70243 | 4.3% |
| r | 70243 | 4.3% |
| M | 70243 | 4.3% |
| i | 70243 | 4.3% |
| u | 70243 | 4.3% |
| Other values (5) | 351215 |
Missing 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 695012 |
| Missing (%) | 95.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 81 |
|---|---|
| Median length | 43 |
| Mean length | 42.23633713 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Georeferencing Quick Reference Guide (2020) |
|---|---|
| 2nd row | Georeferencing Quick Reference Guide (2020) |
| 3rd row | Georeferencing Quick Reference Guide (2020) |
| 4th row | Georeferencing Quick Reference Guide (2020) |
| 5th row | Georeferencing Quick Reference Guide (2020) |
| Value | Count | Frequency (%) |
| georeferencing | 26344 | |
| guide | 26344 | |
| reference | 24178 | |
| 2020 | 24178 | |
| quick | 24178 | |
| biogeomancer | 2166 | 1.4% |
| 2006 | 2166 | 1.4% |
| august | 2166 | 1.4% |
| consortium | 2166 | 1.4% |
| for | 2166 | 1.4% |
| Other values (32) | 13421 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 237471 | |
| 119977 | 9.6% | |
| r | 87730 | 7.0% |
| i | 84069 | 6.7% |
| n | 82720 | 6.6% |
| c | 81302 | 6.5% |
| u | 58822 | 4.7% |
| G | 54854 | 4.4% |
| 0 | 52731 | 4.2% |
| f | 52688 | 4.2% |
| Other values (40) | 333439 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 844245 | |
| Uppercase Letter | 121633 | 9.8% |
| Space Separator | 119977 | 9.6% |
| Decimal Number | 105634 | 8.5% |
| Open Punctuation | 24178 | 1.9% |
| Close Punctuation | 24178 | 1.9% |
| Other Punctuation | 5915 | 0.5% |
| Math Symbol | 43 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 237471 | |
| r | 87730 | 10.4% |
| i | 84069 | 10.0% |
| n | 82720 | 9.8% |
| c | 81302 | 9.6% |
| u | 58822 | 7.0% |
| f | 52688 | 6.2% |
| o | 40962 | 4.9% |
| g | 28625 | 3.4% |
| d | 28111 | 3.3% |
| Other values (12) | 61745 | 7.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 54854 | |
| Q | 25508 | |
| R | 24645 | |
| B | 4332 | 3.6% |
| A | 3450 | 2.8% |
| C | 2537 | 2.1% |
| P | 2195 | 1.8% |
| M | 1338 | 1.1% |
| L | 1299 | 1.1% |
| V | 351 | 0.3% |
| Other values (6) | 1124 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 52731 | |
| 2 | 50522 | |
| 6 | 2166 | 2.1% |
| 5 | 129 | 0.1% |
| 4 | 43 | < 0.1% |
| 8 | 43 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3205 | |
| , | 2710 |
Space Separator
| Value | Count | Frequency (%) |
| 119977 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 24178 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 24178 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 43 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 965878 | |
| Common | 279925 | 22.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 237471 | |
| r | 87730 | 9.1% |
| i | 84069 | 8.7% |
| n | 82720 | 8.6% |
| c | 81302 | 8.4% |
| u | 58822 | 6.1% |
| G | 54854 | 5.7% |
| f | 52688 | 5.5% |
| o | 40962 | 4.2% |
| g | 28625 | 3.0% |
| Other values (28) | 156635 |
Common
| Value | Count | Frequency (%) |
| 119977 | ||
| 0 | 52731 | |
| 2 | 50522 | |
| ( | 24178 | 8.6% |
| ) | 24178 | 8.6% |
| . | 3205 | 1.1% |
| , | 2710 | 1.0% |
| 6 | 2166 | 0.8% |
| 5 | 129 | < 0.1% |
| 4 | 43 | < 0.1% |
| Other values (2) | 86 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1245803 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 237471 | |
| 119977 | 9.6% | |
| r | 87730 | 7.0% |
| i | 84069 | 6.7% |
| n | 82720 | 6.6% |
| c | 81302 | 6.5% |
| u | 58822 | 4.7% |
| G | 54854 | 4.4% |
| 0 | 52731 | 4.2% |
| f | 52688 | 4.2% |
| Other values (40) | 333439 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 40.0% |
| Missing | 724503 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 70 |
|---|---|
| Median length | 70 |
| Mean length | 58 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 20.0% |
Sample
| 1st row | A; B; C; D |
|---|---|
| 2nd row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| 3rd row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| 4th row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| 5th row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| Value | Count | Frequency (%) |
| included | 8 | |
| in | 8 | |
| jennifer | 4 | |
| jett's | 4 | |
| foram | 4 | |
| bulk | 4 | |
| db | 4 | |
| but | 4 | |
| not | 4 | |
| f | 4 | |
| Other values (5) | 8 |
Most occurring characters
| Value | Count | Frequency (%) |
| 51 | ||
| n | 28 | 9.7% |
| e | 28 | 9.7% |
| i | 20 | 6.9% |
| d | 20 | 6.9% |
| u | 16 | 5.5% |
| t | 16 | 5.5% |
| r | 12 | 4.1% |
| l | 12 | 4.1% |
| B | 9 | 3.1% |
| Other values (17) | 78 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 196 | |
| Space Separator | 51 | 17.6% |
| Uppercase Letter | 36 | 12.4% |
| Other Punctuation | 7 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 28 | |
| e | 28 | |
| i | 20 | |
| d | 20 | |
| u | 16 | |
| t | 16 | |
| r | 12 | |
| l | 12 | |
| c | 8 | 4.1% |
| o | 8 | 4.1% |
| Other values (7) | 28 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 9 | |
| J | 8 | |
| F | 8 | |
| D | 5 | |
| L | 4 | |
| A | 1 | 2.8% |
| C | 1 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 4 | |
| ; | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 51 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 232 | |
| Common | 58 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 28 | |
| e | 28 | |
| i | 20 | 8.6% |
| d | 20 | 8.6% |
| u | 16 | 6.9% |
| t | 16 | 6.9% |
| r | 12 | 5.2% |
| l | 12 | 5.2% |
| B | 9 | 3.9% |
| J | 8 | 3.4% |
| Other values (14) | 63 |
Common
| Value | Count | Frequency (%) |
| 51 | ||
| ' | 4 | 6.9% |
| ; | 3 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 51 | ||
| n | 28 | 9.7% |
| e | 28 | 9.7% |
| i | 20 | 6.9% |
| d | 20 | 6.9% |
| u | 16 | 5.5% |
| t | 16 | 5.5% |
| r | 12 | 4.1% |
| l | 12 | 4.1% |
| B | 9 | 3.1% |
| Other values (17) | 78 |
earliestEraOrLowestErathem
Text
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 220036 |
| Missing (%) | 30.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 8.387123567 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mesozoic |
|---|---|
| 2nd row | Cenozoic |
| 3rd row | Cenozoic |
| 4th row | Paleozoic |
| 5th row | Cenozoic |
| Value | Count | Frequency (%) |
| cenozoic | 261752 | |
| paleozoic | 194023 | |
| mesozoic | 48343 | 9.6% |
| precambrian | 298 | 0.1% |
| mesoproterozoic | 41 | < 0.1% |
| neoproterozoic | 7 | < 0.1% |
| paleoproterozoic | 4 | < 0.1% |
| paleoarchean | 3 | < 0.1% |
| mesoarchean | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 6.2% |
| C | 261752 | 6.2% |
| a | 194634 | 4.6% |
| P | 194327 | 4.6% |
| l | 194030 | 4.6% |
| Other values (9) | 98186 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3726598 | |
| Uppercase Letter | 504471 | 11.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 7.0% |
| a | 194634 | 5.2% |
| l | 194030 | 5.2% |
| s | 48385 | 1.3% |
| r | 704 | < 0.1% |
| Other values (5) | 705 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 261752 | |
| P | 194327 | |
| M | 48385 | 9.6% |
| N | 7 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4231069 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 6.2% |
| C | 261752 | 6.2% |
| a | 194634 | 4.6% |
| P | 194327 | 4.6% |
| l | 194030 | 4.6% |
| Other values (9) | 98186 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4231069 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 6.2% |
| C | 261752 | 6.2% |
| a | 194634 | 4.6% |
| P | 194327 | 4.6% |
| l | 194030 | 4.6% |
| Other values (9) | 98186 | 2.3% |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 718163 |
| Missing (%) | 99.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 8.134121355 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paleozoic |
|---|---|
| 2nd row | Cenozoic |
| 3rd row | Mesozoic |
| 4th row | Cenozoic |
| 5th row | Cenozoic |
| Value | Count | Frequency (%) |
| cenozoic | 5229 | |
| paleozoic | 826 | 13.0% |
| mesozoic | 286 | 4.5% |
| neoproterozoic | 3 | < 0.1% |
| mesoproterozoic | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| C | 5229 | |
| n | 5229 | |
| P | 826 | 1.6% |
| a | 826 | 1.6% |
| l | 826 | 1.6% |
| Other values (6) | 593 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45266 | |
| Uppercase Letter | 6345 | 12.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| n | 5229 | |
| a | 826 | 1.8% |
| l | 826 | 1.8% |
| s | 287 | 0.6% |
| r | 8 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 5229 | |
| P | 826 | 13.0% |
| M | 287 | 4.5% |
| N | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51611 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| C | 5229 | |
| n | 5229 | |
| P | 826 | 1.6% |
| a | 826 | 1.6% |
| l | 826 | 1.6% |
| Other values (6) | 593 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51611 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| C | 5229 | |
| n | 5229 | |
| P | 826 | 1.6% |
| a | 826 | 1.6% |
| l | 826 | 1.6% |
| Other values (6) | 593 | 1.1% |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 245750 |
| Missing (%) | 33.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.607453035 |
| Min length | 6 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Triassic |
|---|---|
| 2nd row | Paleogene |
| 3rd row | Neogene |
| 4th row | Permian |
| 5th row | Quaternary |
| Value | Count | Frequency (%) |
| paleogene | 90464 | |
| neogene | 72075 | |
| cambrian | 48808 | |
| recent | 41336 | |
| ordovician | 34462 | 7.2% |
| cretaceous | 34238 | 7.2% |
| permian | 32455 | 6.8% |
| quaternary | 27798 | 5.8% |
| devonian | 27637 | 5.8% |
| mississippian | 19734 | 4.1% |
| Other values (14) | 49751 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | 7.8% |
| o | 263741 | 6.4% |
| r | 242986 | 5.9% |
| g | 162539 | 3.9% |
| s | 160613 | 3.9% |
| P | 140533 | 3.4% |
| c | 124669 | 3.0% |
| Other values (25) | 986683 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3642156 | |
| Uppercase Letter | 478731 | 11.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | |
| o | 263741 | 7.2% |
| r | 242986 | 6.7% |
| g | 162539 | 4.5% |
| s | 160613 | 4.4% |
| c | 124669 | 3.4% |
| l | 120100 | 3.3% |
| Other values (11) | 528385 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 140533 | |
| C | 84743 | |
| N | 72075 | |
| R | 41337 | 8.6% |
| O | 34462 | 7.2% |
| Q | 27798 | 5.8% |
| D | 27637 | 5.8% |
| M | 20068 | 4.2% |
| S | 11625 | 2.4% |
| T | 9097 | 1.9% |
| Other values (4) | 9356 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4120887 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | 7.8% |
| o | 263741 | 6.4% |
| r | 242986 | 5.9% |
| g | 162539 | 3.9% |
| s | 160613 | 3.9% |
| P | 140533 | 3.4% |
| c | 124669 | 3.0% |
| Other values (25) | 986683 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4120887 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | 7.8% |
| o | 263741 | 6.4% |
| r | 242986 | 5.9% |
| g | 162539 | 3.9% |
| s | 160613 | 3.9% |
| P | 140533 | 3.4% |
| c | 124669 | 3.0% |
| Other values (25) | 986683 |
latestPeriodOrHighestSystem
Text
Missing 
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 718167 |
| Missing (%) | 99.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.077905693 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Devonian |
|---|---|
| 2nd row | Neogene |
| 3rd row | Cretaceous |
| 4th row | Quaternary |
| 5th row | Recent |
| Value | Count | Frequency (%) |
| neogene | 3161 | |
| paleogene | 1404 | |
| quaternary | 668 | 10.5% |
| devonian | 416 | 6.6% |
| cretaceous | 185 | 2.9% |
| cambrian | 161 | 2.5% |
| ordovician | 137 | 2.2% |
| pennsylvanian | 77 | 1.2% |
| recent | 60 | 0.9% |
| silurian | 30 | 0.5% |
| Other values (5) | 42 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 10.4% |
| g | 4565 | 8.9% |
| a | 4026 | 7.9% |
| N | 3161 | 6.2% |
| r | 1892 | 3.7% |
| l | 1511 | 2.9% |
| P | 1484 | 2.9% |
| i | 1053 | 2.1% |
| Other values (18) | 6103 | 11.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44881 | |
| Uppercase Letter | 6341 | 12.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 11.8% |
| g | 4565 | 10.2% |
| a | 4026 | 9.0% |
| r | 1892 | 4.2% |
| l | 1511 | 3.4% |
| i | 1053 | 2.3% |
| t | 914 | 2.0% |
| u | 898 | 2.0% |
| Other values (8) | 2595 | 5.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3161 | |
| P | 1484 | |
| Q | 668 | 10.5% |
| D | 416 | 6.6% |
| C | 348 | 5.5% |
| O | 137 | 2.2% |
| R | 60 | 0.9% |
| S | 31 | 0.5% |
| T | 23 | 0.4% |
| J | 13 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51222 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 10.4% |
| g | 4565 | 8.9% |
| a | 4026 | 7.9% |
| N | 3161 | 6.2% |
| r | 1892 | 3.7% |
| l | 1511 | 2.9% |
| P | 1484 | 2.9% |
| i | 1053 | 2.1% |
| Other values (18) | 6103 | 11.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51222 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 10.4% |
| g | 4565 | 8.9% |
| a | 4026 | 7.9% |
| N | 3161 | 6.2% |
| r | 1892 | 3.7% |
| l | 1511 | 2.9% |
| P | 1484 | 2.9% |
| i | 1053 | 2.1% |
| Other values (18) | 6103 | 11.9% |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 376914 |
| Missing (%) | 52.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 6.357434248 |
| Min length | 1 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Middle |
|---|---|
| 2nd row | Eocene |
| 3rd row | Pliocene |
| 4th row | Pleistocene |
| 5th row | Early |
| Value | Count | Frequency (%) |
| middle | 68576 | |
| eocene | 66980 | |
| late | 57993 | |
| miocene | 39410 | |
| early | 37474 | |
| pliocene | 32039 | |
| pleistocene | 20013 | 5.8% |
| oligocene | 15521 | 4.5% |
| paleocene | 7752 | 2.2% |
| holocene | 1481 | 0.4% |
| Other values (10) | 355 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 8.4% |
| n | 183525 | 8.3% |
| c | 183200 | 8.3% |
| l | 183151 | 8.3% |
| i | 175926 | 8.0% |
| d | 137364 | 6.2% |
| M | 107985 | 4.9% |
| E | 104453 | 4.7% |
| a | 104017 | 4.7% |
| Other values (22) | 324681 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1862169 | |
| Uppercase Letter | 347612 | 15.7% |
| Other Punctuation | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 9.9% |
| n | 183525 | 9.9% |
| c | 183200 | 9.8% |
| l | 183151 | 9.8% |
| i | 175926 | 9.4% |
| d | 137364 | 7.4% |
| a | 104017 | 5.6% |
| t | 78031 | 4.2% |
| r | 37590 | 2.0% |
| Other values (9) | 73861 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 107985 | |
| E | 104453 | |
| P | 59809 | |
| L | 58036 | |
| O | 15517 | 4.5% |
| H | 1481 | 0.4% |
| G | 195 | 0.1% |
| C | 77 | < 0.1% |
| D | 27 | < 0.1% |
| U | 25 | < 0.1% |
| Other values (2) | 7 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 25 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2209781 | |
| Common | 25 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 8.4% |
| n | 183525 | 8.3% |
| c | 183200 | 8.3% |
| l | 183151 | 8.3% |
| i | 175926 | 8.0% |
| d | 137364 | 6.2% |
| M | 107985 | 4.9% |
| E | 104453 | 4.7% |
| a | 104017 | 4.7% |
| Other values (21) | 324656 |
Common
| Value | Count | Frequency (%) |
| / | 25 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2209806 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 8.4% |
| n | 183525 | 8.3% |
| c | 183200 | 8.3% |
| l | 183151 | 8.3% |
| i | 175926 | 8.0% |
| d | 137364 | 6.2% |
| M | 107985 | 4.9% |
| E | 104453 | 4.7% |
| a | 104017 | 4.7% |
| Other values (22) | 324681 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 718290 |
| Missing (%) | 99.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 7.33708588 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Middle |
|---|---|
| 2nd row | Pliocene |
| 3rd row | Late |
| 4th row | Pleistocene |
| 5th row | Miocene |
| Value | Count | Frequency (%) |
| pliocene | 2384 | |
| eocene | 1075 | |
| miocene | 759 | 12.2% |
| late | 645 | 10.4% |
| pleistocene | 645 | 10.4% |
| middle | 364 | 5.9% |
| oligocene | 188 | 3.0% |
| paleocene | 97 | 1.6% |
| early | 34 | 0.5% |
| holocene | 14 | 0.2% |
| Other values (2) | 13 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 9.5% |
| l | 3726 | 8.2% |
| P | 3126 | 6.9% |
| t | 1302 | 2.9% |
| M | 1123 | 2.5% |
| E | 1109 | 2.4% |
| Other values (11) | 3268 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39404 | |
| Uppercase Letter | 6218 | 13.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 11.0% |
| l | 3726 | 9.5% |
| t | 1302 | 3.3% |
| a | 777 | 2.0% |
| d | 728 | 1.8% |
| s | 645 | 1.6% |
| Other values (4) | 258 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3126 | |
| M | 1123 | 18.1% |
| E | 1109 | 17.8% |
| L | 646 | 10.4% |
| O | 188 | 3.0% |
| H | 14 | 0.2% |
| R | 12 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45622 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 9.5% |
| l | 3726 | 8.2% |
| P | 3126 | 6.9% |
| t | 1302 | 2.9% |
| M | 1123 | 2.5% |
| E | 1109 | 2.4% |
| Other values (11) | 3268 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 9.5% |
| l | 3726 | 8.2% |
| P | 3126 | 6.9% |
| t | 1302 | 2.9% |
| M | 1123 | 2.5% |
| E | 1109 | 2.4% |
| Other values (11) | 3268 | 7.2% |
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 562472 |
| Missing (%) | 77.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 19 |
| Mean length | 9.036053716 |
| Min length | 4 |
Unique
| Unique | 38 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Anisian |
|---|---|
| 2nd row | Hemphillian |
| 3rd row | Middle |
| 4th row | Emsian |
| 5th row | Irvingtonian |
| Value | Count | Frequency (%) |
| hemphillian | 19681 | 12.1% |
| middle | 17380 | 10.7% |
| wasatchian | 7037 | 4.3% |
| early | 5466 | 3.4% |
| orellan | 5085 | 3.1% |
| bridgerian | 4799 | 2.9% |
| maastrichtian | 4686 | 2.9% |
| campanian | 4051 | 2.5% |
| chadronian | 3871 | 2.4% |
| ypresian | 3476 | 2.1% |
| Other values (350) | 87399 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | 7.2% |
| l | 96307 | 6.6% |
| r | 75689 | 5.2% |
| d | 61340 | 4.2% |
| o | 52724 | 3.6% |
| h | 47497 | 3.2% |
| s | 40454 | 2.8% |
| Other values (44) | 369454 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1300773 | |
| Uppercase Letter | 162483 | 11.1% |
| Space Separator | 895 | 0.1% |
| Other Punctuation | 13 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | |
| l | 96307 | |
| r | 75689 | 5.8% |
| d | 61340 | 4.7% |
| o | 52724 | 4.1% |
| h | 47497 | 3.7% |
| s | 40454 | 3.1% |
| Other values (16) | 206061 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 28152 | |
| C | 21480 | |
| H | 20672 | |
| W | 12315 | |
| B | 10522 | 6.5% |
| O | 10358 | 6.4% |
| T | 8937 | 5.5% |
| E | 7395 | 4.6% |
| A | 6493 | 4.0% |
| L | 6455 | 4.0% |
| Other values (14) | 29704 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 12 | |
| , | 1 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 895 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1463256 | |
| Common | 910 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | 7.2% |
| l | 96307 | 6.6% |
| r | 75689 | 5.2% |
| d | 61340 | 4.2% |
| o | 52724 | 3.6% |
| h | 47497 | 3.2% |
| s | 40454 | 2.8% |
| Other values (40) | 368544 |
Common
| Value | Count | Frequency (%) |
| 895 | ||
| / | 12 | 1.3% |
| 4 | 2 | 0.2% |
| , | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1464166 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | 7.2% |
| l | 96307 | 6.6% |
| r | 75689 | 5.2% |
| d | 61340 | 4.2% |
| o | 52724 | 3.6% |
| h | 47497 | 3.2% |
| s | 40454 | 2.8% |
| Other values (44) | 369454 |
Missing 
| Distinct | 35 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 722133 |
| Missing (%) | 99.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.232 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Givetian |
|---|---|
| 2nd row | Turonian |
| 3rd row | Gelasian |
| 4th row | Gelasian |
| 5th row | Gelasian |
| Value | Count | Frequency (%) |
| lutetian | 829 | |
| zanclean | 319 | 13.4% |
| tortonian | 217 | 9.1% |
| gelasian | 200 | 8.4% |
| maastrichtian | 105 | 4.4% |
| late | 98 | 4.1% |
| messinian | 78 | 3.3% |
| thanetian | 78 | 3.3% |
| ypresian | 60 | 2.5% |
| langhian | 58 | 2.4% |
| Other values (25) | 333 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| L | 1015 | 5.2% |
| u | 862 | 4.4% |
| l | 662 | 3.4% |
| o | 553 | 2.8% |
| s | 534 | 2.7% |
| Other values (28) | 3067 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17176 | |
| Uppercase Letter | 2375 | 12.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| u | 862 | 5.0% |
| l | 662 | 3.9% |
| o | 553 | 3.2% |
| s | 534 | 3.1% |
| r | 515 | 3.0% |
| Other values (13) | 1192 | 6.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1015 | |
| Z | 319 | 13.4% |
| T | 297 | 12.5% |
| G | 223 | 9.4% |
| M | 196 | 8.3% |
| E | 90 | 3.8% |
| Y | 60 | 2.5% |
| P | 53 | 2.2% |
| C | 50 | 2.1% |
| B | 32 | 1.3% |
| Other values (5) | 40 | 1.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19551 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| L | 1015 | 5.2% |
| u | 862 | 4.4% |
| l | 662 | 3.4% |
| o | 553 | 2.8% |
| s | 534 | 2.7% |
| Other values (28) | 3067 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19551 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| L | 1015 | 5.2% |
| u | 862 | 4.4% |
| l | 662 | 3.4% |
| o | 553 | 2.8% |
| s | 534 | 2.7% |
| Other values (28) | 3067 |
group
Text
Missing 
| Distinct | 557 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 633218 |
| Missing (%) | 87.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 28 |
| Mean length | 14.80891664 |
| Min length | 1 |
Unique
| Unique | 146 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Star Peak Group |
|---|---|
| 2nd row | Chesapeake Group |
| 3rd row | Keokuk Group |
| 4th row | Chesapeake Group |
| 5th row | Chesapeake Group |
| Value | Count | Frequency (%) |
| group | 90331 | |
| chesapeake | 38410 | |
| river | 7802 | 4.0% |
| white | 5751 | 3.0% |
| selma | 3439 | 1.8% |
| kewanee | 2702 | 1.4% |
| hamilton | 2337 | 1.2% |
| osage | 2256 | 1.2% |
| washita | 1421 | 0.7% |
| pamunkey | 1419 | 0.7% |
| Other values (577) | 37508 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | 8.8% |
| r | 115845 | 8.6% |
| o | 113583 | 8.4% |
| 102086 | 7.6% | |
| u | 98547 | 7.3% |
| G | 90741 | 6.7% |
| s | 54633 | 4.0% |
| h | 50628 | 3.7% |
| Other values (47) | 309165 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1056168 | |
| Uppercase Letter | 193474 | 14.3% |
| Space Separator | 102086 | 7.6% |
| Other Punctuation | 124 | < 0.1% |
| Open Punctuation | 21 | < 0.1% |
| Close Punctuation | 21 | < 0.1% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | |
| r | 115845 | |
| o | 113583 | |
| u | 98547 | |
| s | 54633 | 5.2% |
| h | 50628 | 4.8% |
| k | 45139 | 4.3% |
| i | 34291 | 3.2% |
| Other values (16) | 126824 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 90741 | |
| C | 43143 | |
| R | 9045 | 4.7% |
| W | 8105 | 4.2% |
| S | 6248 | 3.2% |
| M | 4589 | 2.4% |
| P | 4340 | 2.2% |
| K | 3671 | 1.9% |
| O | 3592 | 1.9% |
| H | 3351 | 1.7% |
| Other values (15) | 16649 | 8.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 88 | |
| , | 36 |
Space Separator
| Value | Count | Frequency (%) |
| 102086 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1249642 | |
| Common | 102264 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | |
| r | 115845 | |
| o | 113583 | |
| u | 98547 | 7.9% |
| G | 90741 | 7.3% |
| s | 54633 | 4.4% |
| h | 50628 | 4.1% |
| k | 45139 | 3.6% |
| Other values (41) | 263848 |
Common
| Value | Count | Frequency (%) |
| 102086 | ||
| . | 88 | 0.1% |
| , | 36 | < 0.1% |
| ( | 21 | < 0.1% |
| ) | 21 | < 0.1% |
| - | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1351906 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | 8.8% |
| r | 115845 | 8.6% |
| o | 113583 | 8.4% |
| 102086 | 7.6% | |
| u | 98547 | 7.3% |
| G | 90741 | 6.7% |
| s | 54633 | 4.0% |
| h | 50628 | 3.7% |
| Other values (47) | 309165 |
formation
Text
Missing 
| Distinct | 5419 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 365706 |
| Missing (%) | 50.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 38 |
| Mean length | 11.49027319 |
| Min length | 3 |
Unique
| Unique | 1482 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Prida Fm |
|---|---|
| 2nd row | Yorktown Fm |
| 3rd row | Skinner Ranch Fm |
| 4th row | San Pedro Fm |
| 5th row | Grande Greve Fm |
| Value | Count | Frequency (%) |
| fm | 259134 | |
| river | 44301 | 5.5% |
| ls | 39737 | 4.9% |
| stephen | 31376 | 3.9% |
| green | 29207 | 3.6% |
| yorktown | 23754 | 2.9% |
| unknown | 18762 | 2.3% |
| sh | 17735 | 2.2% |
| pungo | 10262 | 1.3% |
| canyon | 8111 | 1.0% |
| Other values (4425) | 326422 |
Most occurring characters
| Value | Count | Frequency (%) |
| 449999 | 10.9% | |
| e | 361227 | 8.8% |
| n | 317355 | 7.7% |
| m | 288475 | 7.0% |
| F | 271104 | 6.6% |
| r | 245377 | 6.0% |
| o | 238913 | 5.8% |
| a | 212844 | 5.2% |
| i | 166070 | 4.0% |
| t | 160119 | 3.9% |
| Other values (56) | 1411250 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2858690 | |
| Uppercase Letter | 809683 | 19.6% |
| Space Separator | 449999 | 10.9% |
| Other Punctuation | 3867 | 0.1% |
| Decimal Number | 156 | < 0.1% |
| Open Punctuation | 135 | < 0.1% |
| Close Punctuation | 134 | < 0.1% |
| Dash Punctuation | 69 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 361227 | |
| n | 317355 | |
| m | 288475 | |
| r | 245377 | 8.6% |
| o | 238913 | 8.4% |
| a | 212844 | 7.4% |
| i | 166070 | 5.8% |
| t | 160119 | 5.6% |
| l | 128749 | 4.5% |
| s | 112733 | 3.9% |
| Other values (16) | 626828 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 271104 | |
| S | 78359 | 9.7% |
| R | 63222 | 7.8% |
| L | 61354 | 7.6% |
| C | 52642 | 6.5% |
| G | 37852 | 4.7% |
| B | 36649 | 4.5% |
| M | 26756 | 3.3% |
| P | 26718 | 3.3% |
| Y | 24537 | 3.0% |
| Other values (15) | 130490 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2426 | |
| , | 703 | 18.2% |
| ? | 651 | 16.8% |
| ' | 64 | 1.7% |
| / | 19 | 0.5% |
| " | 4 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 147 | |
| 3 | 3 | 1.9% |
| 9 | 2 | 1.3% |
| 2 | 2 | 1.3% |
| 0 | 2 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 449999 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 135 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 134 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 69 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3668373 | |
| Common | 454360 | 11.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 361227 | 9.8% |
| n | 317355 | 8.7% |
| m | 288475 | 7.9% |
| F | 271104 | 7.4% |
| r | 245377 | 6.7% |
| o | 238913 | 6.5% |
| a | 212844 | 5.8% |
| i | 166070 | 4.5% |
| t | 160119 | 4.4% |
| l | 128749 | 3.5% |
| Other values (41) | 1278140 |
Common
| Value | Count | Frequency (%) |
| 449999 | ||
| . | 2426 | 0.5% |
| , | 703 | 0.2% |
| ? | 651 | 0.1% |
| 1 | 147 | < 0.1% |
| ( | 135 | < 0.1% |
| ) | 134 | < 0.1% |
| - | 69 | < 0.1% |
| ' | 64 | < 0.1% |
| / | 19 | < 0.1% |
| Other values (5) | 13 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4122733 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 449999 | 10.9% | |
| e | 361227 | 8.8% |
| n | 317355 | 7.7% |
| m | 288475 | 7.0% |
| F | 271104 | 6.6% |
| r | 245377 | 6.0% |
| o | 238913 | 5.8% |
| a | 212844 | 5.2% |
| i | 166070 | 4.0% |
| t | 160119 | 3.9% |
| Other values (56) | 1411250 |
member
Text
Missing 
| Distinct | 1626 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 643191 |
| Missing (%) | 88.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 30 |
| Mean length | 13.99831524 |
| Min length | 1 |
Unique
| Unique | 471 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Fossil Hill Mbr |
|---|---|
| 2nd row | Decie Ranch Mbr |
| 3rd row | Millersburg Mbr |
| 4th row | Thin-Bedded Zone Of Udden |
| 5th row | Burgess Sh Mbr |
| Value | Count | Frequency (%) |
| mbr | 79698 | |
| sh | 36967 | |
| burgess | 30811 | 13.2% |
| ls | 6535 | 2.8% |
| creek | 4230 | 1.8% |
| sunken | 3525 | 1.5% |
| meadow | 3525 | 1.5% |
| ranch | 3361 | 1.4% |
| francis | 2603 | 1.1% |
| b | 2492 | 1.1% |
| Other values (1500) | 60135 |
Most occurring characters
| Value | Count | Frequency (%) |
| 152565 | ||
| r | 138201 | |
| M | 87327 | 7.7% |
| s | 86157 | 7.6% |
| b | 84523 | 7.4% |
| e | 79157 | 7.0% |
| h | 47967 | 4.2% |
| S | 46866 | 4.1% |
| u | 42615 | 3.7% |
| a | 41195 | 3.6% |
| Other values (60) | 331728 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 749978 | |
| Uppercase Letter | 232978 | 20.5% |
| Space Separator | 152565 | 13.4% |
| Decimal Number | 2131 | 0.2% |
| Other Punctuation | 324 | < 0.1% |
| Dash Punctuation | 290 | < 0.1% |
| Open Punctuation | 17 | < 0.1% |
| Close Punctuation | 17 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 138201 | |
| s | 86157 | |
| b | 84523 | |
| e | 79157 | |
| h | 47967 | 6.4% |
| u | 42615 | 5.7% |
| a | 41195 | 5.5% |
| g | 38517 | 5.1% |
| n | 36464 | 4.9% |
| i | 27554 | 3.7% |
| Other values (16) | 127628 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 87327 | |
| S | 46866 | |
| B | 39596 | |
| C | 10761 | 4.6% |
| L | 9429 | 4.0% |
| R | 5451 | 2.3% |
| F | 4926 | 2.1% |
| P | 4323 | 1.9% |
| G | 4164 | 1.8% |
| W | 4116 | 1.8% |
| Other values (15) | 16019 | 6.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 858 | |
| 2 | 337 | 15.8% |
| 3 | 289 | 13.6% |
| 4 | 247 | 11.6% |
| 5 | 130 | 6.1% |
| 0 | 124 | 5.8% |
| 6 | 102 | 4.8% |
| 7 | 24 | 1.1% |
| 9 | 16 | 0.8% |
| 8 | 4 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 131 | |
| . | 128 | |
| ? | 64 | |
| ' | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 152565 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 290 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 17 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 17 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 982956 | |
| Common | 155345 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 138201 | |
| M | 87327 | 8.9% |
| s | 86157 | 8.8% |
| b | 84523 | 8.6% |
| e | 79157 | 8.1% |
| h | 47967 | 4.9% |
| S | 46866 | 4.8% |
| u | 42615 | 4.3% |
| a | 41195 | 4.2% |
| B | 39596 | 4.0% |
| Other values (41) | 289352 |
Common
| Value | Count | Frequency (%) |
| 152565 | ||
| 1 | 858 | 0.6% |
| 2 | 337 | 0.2% |
| - | 290 | 0.2% |
| 3 | 289 | 0.2% |
| 4 | 247 | 0.2% |
| , | 131 | 0.1% |
| 5 | 130 | 0.1% |
| . | 128 | 0.1% |
| 0 | 124 | 0.1% |
| Other values (9) | 246 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1138301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 152565 | ||
| r | 138201 | |
| M | 87327 | 7.7% |
| s | 86157 | 7.6% |
| b | 84523 | 7.4% |
| e | 79157 | 7.0% |
| h | 47967 | 4.2% |
| S | 46866 | 4.1% |
| u | 42615 | 3.7% |
| a | 41195 | 3.6% |
| Other values (60) | 331728 |
typeStatus
Text
Missing 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 582086 |
| Missing (%) | 80.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 7.803239668 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PARATYPE |
|---|---|
| 2nd row | PARATYPE |
| 3rd row | PARATYPE |
| 4th row | TYPE |
| 5th row | HOLOTYPE |
| Value | Count | Frequency (%) |
| paratype | 74612 | |
| holotype | 34645 | |
| syntype | 19534 | 13.7% |
| type | 7903 | 5.5% |
| paralectotype | 2966 | 2.1% |
| lectotype | 1051 | 0.7% |
| plastoholotype | 593 | 0.4% |
| plastotype | 389 | 0.3% |
| plastoparatype | 282 | 0.2% |
| plastosyntype | 253 | 0.2% |
| Other values (5) | 194 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 221818 | |
| Y | 162209 | |
| A | 157256 | |
| T | 147992 | |
| E | 146600 | |
| R | 77860 | 7.0% |
| O | 76223 | 6.9% |
| L | 40808 | 3.7% |
| H | 35238 | 3.2% |
| S | 21351 | 1.9% |
| Other values (3) | 23998 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1111353 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 221818 | |
| Y | 162209 | |
| A | 157256 | |
| T | 147992 | |
| E | 146600 | |
| R | 77860 | 7.0% |
| O | 76223 | 6.9% |
| L | 40808 | 3.7% |
| H | 35238 | 3.2% |
| S | 21351 | 1.9% |
| Other values (3) | 23998 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1111353 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 221818 | |
| Y | 162209 | |
| A | 157256 | |
| T | 147992 | |
| E | 146600 | |
| R | 77860 | 7.0% |
| O | 76223 | 6.9% |
| L | 40808 | 3.7% |
| H | 35238 | 3.2% |
| S | 21351 | 1.9% |
| Other values (3) | 23998 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1111353 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 221818 | |
| Y | 162209 | |
| A | 157256 | |
| T | 147992 | |
| E | 146600 | |
| R | 77860 | 7.0% |
| O | 76223 | 6.9% |
| L | 40808 | 3.7% |
| H | 35238 | 3.2% |
| S | 21351 | 1.9% |
| Other values (3) | 23998 | 2.2% |
identifiedBy
Text
Missing 
| Distinct | 2463 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 521981 |
| Missing (%) | 72.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 147 |
|---|---|
| Median length | 124 |
| Mean length | 22.47668212 |
| Min length | 2 |
Unique
| Unique | 535 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Silberling; Nichols |
|---|---|
| 2nd row | Vaughan |
| 3rd row | Harper; Boucot |
| 4th row | Said; Barakat, M. G. |
| 5th row | Smith |
| Value | Count | Frequency (%) |
| united | 21468 | 3.2% |
| states | 21082 | 3.2% |
| of | 20281 | 3.1% |
| museum | 15734 | 2.4% |
| helen | 15316 | 2.3% |
| 12006 | 1.8% | |
| natural | 11887 | 1.8% |
| history | 11620 | 1.8% |
| institution | 11572 | 1.7% |
| smithsonian | 11571 | 1.7% |
| Other values (2466) | 510240 |
Most occurring characters
| Value | Count | Frequency (%) |
| 460250 | 10.1% | |
| e | 280098 | 6.2% |
| o | 272102 | 6.0% |
| a | 259642 | 5.7% |
| n | 241275 | 5.3% |
| t | 230888 | 5.1% |
| r | 226036 | 5.0% |
| i | 214007 | 4.7% |
| l | 181066 | 4.0% |
| s | 174306 | 3.8% |
| Other values (58) | 2012465 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2806351 | |
| Uppercase Letter | 908175 | 20.0% |
| Space Separator | 460250 | 10.1% |
| Other Punctuation | 280258 | 6.2% |
| Close Punctuation | 40168 | 0.9% |
| Open Punctuation | 40168 | 0.9% |
| Dash Punctuation | 16765 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 280098 | |
| o | 272102 | |
| a | 259642 | |
| n | 241275 | 8.6% |
| t | 230888 | 8.2% |
| r | 226036 | 8.1% |
| i | 214007 | 7.6% |
| l | 181066 | 6.5% |
| s | 174306 | 6.2% |
| u | 121224 | 4.3% |
| Other values (22) | 605707 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 117932 | 13.0% |
| T | 78022 | 8.6% |
| A | 60143 | 6.6% |
| N | 59104 | 6.5% |
| C | 57622 | 6.3% |
| E | 56100 | 6.2% |
| I | 46266 | 5.1% |
| D | 44046 | 4.8% |
| H | 42705 | 4.7% |
| U | 40270 | 4.4% |
| Other values (16) | 305965 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 138675 | |
| . | 77116 | |
| ; | 64257 | |
| / | 177 | 0.1% |
| ' | 23 | < 0.1% |
| & | 10 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 460250 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 40168 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 40168 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16765 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3714526 | |
| Common | 837609 | 18.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 280098 | 7.5% |
| o | 272102 | 7.3% |
| a | 259642 | 7.0% |
| n | 241275 | 6.5% |
| t | 230888 | 6.2% |
| r | 226036 | 6.1% |
| i | 214007 | 5.8% |
| l | 181066 | 4.9% |
| s | 174306 | 4.7% |
| u | 121224 | 3.3% |
| Other values (48) | 1513882 |
Common
| Value | Count | Frequency (%) |
| 460250 | ||
| , | 138675 | 16.6% |
| . | 77116 | 9.2% |
| ; | 64257 | 7.7% |
| ) | 40168 | 4.8% |
| ( | 40168 | 4.8% |
| - | 16765 | 2.0% |
| / | 177 | < 0.1% |
| ' | 23 | < 0.1% |
| & | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4550350 | |
| None | 1785 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 460250 | 10.1% | |
| e | 280098 | 6.2% |
| o | 272102 | 6.0% |
| a | 259642 | 5.7% |
| n | 241275 | 5.3% |
| t | 230888 | 5.1% |
| r | 226036 | 5.0% |
| i | 214007 | 4.7% |
| l | 181066 | 4.0% |
| s | 174306 | 3.8% |
| Other values (52) | 2010680 |
None
| Value | Count | Frequency (%) |
| ñ | 1143 | |
| ý | 251 | 14.1% |
| š | 251 | 14.1% |
| ö | 138 | 7.7% |
| ú | 1 | 0.1% |
| í | 1 | 0.1% |
acceptedNameUsageID
Real number (ℝ)
Missing 
| Distinct | 58335 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 171789 |
| Missing (%) | 23.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5515085.25 |
| Minimum | 1 |
|---|---|
| Maximum | 12385426 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 216 |
| Q1 | 3249393 |
| median | 4941659 |
| Q3 | 8513230 |
| 95-th percentile | 9626241 |
| Maximum | 12385426 |
| Range | 12385425 |
| Interquartile range (IQR) | 5263837 |
Descriptive statistics
| Standard deviation | 3184869.125 |
|---|---|
| Coefficient of variation (CV) | 0.5774832084 |
| Kurtosis | -0.8885403732 |
| Mean | 5515085.25 |
| Median Absolute Deviation (MAD) | 2688948 |
| Skewness | -0.1769613565 |
| Sum | 3.048292404 × 1012 |
| Variance | 1.014339134 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 216 | 16872 | 2.3% |
| 8513230 | 13693 | 1.9% |
| 4806028 | 12281 | 1.7% |
| 6 | 11457 | 1.6% |
| 359 | 4656 | 0.6% |
| 44 | 4268 | 0.6% |
| 729 | 3674 | 0.5% |
| 353 | 3566 | 0.5% |
| 2481460 | 3232 | 0.4% |
| 4832444 | 3022 | 0.4% |
| Other values (58325) | 475998 | |
| (Missing) | 171789 | 23.7% |
| Value | Count | Frequency (%) |
| 1 | 1114 | 0.2% |
| 6 | 11457 | |
| 42 | 952 | 0.1% |
| 43 | 51 | < 0.1% |
| 44 | 4268 | 0.6% |
| Value | Count | Frequency (%) |
| 12385426 | 4 | < 0.1% |
| 12385220 | 2 | < 0.1% |
| 12379591 | 6 | < 0.1% |
| 12362277 | 15 | |
| 12358726 | 5 | < 0.1% |
scientificName
Text
| Distinct | 65364 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 124 |
|---|---|
| Median length | 82 |
| Mean length | 24.76860849 |
| Min length | 3 |
Unique
| Unique | 24744 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | incertae sedis |
|---|---|
| 2nd row | Damaliscus lunatus (Burchell, 1823) |
| 3rd row | Acrochordiceras hyatti Meek, 1877 |
| 4th row | Discocyclina sculpturata (Cushman, 1919) |
| 5th row | Odontaspis cuspidata (Agassiz, 1843) |
| Value | Count | Frequency (%) |
| incertae | 171789 | 7.4% |
| sedis | 171789 | 7.4% |
| 80645 | 3.5% | |
| walcott | 31003 | 1.3% |
| cooper | 24261 | 1.1% |
| cushman | 17003 | 0.7% |
| insecta | 16882 | 0.7% |
| 1912 | 16564 | 0.7% |
| grant | 16169 | 0.7% |
| 1976 | 14713 | 0.6% |
| Other values (47365) | 1749493 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1585803 | 8.8% | |
| e | 1485632 | 8.3% |
| a | 1415466 | 7.9% |
| i | 1243670 | 6.9% |
| s | 1115918 | 6.2% |
| r | 978896 | 5.5% |
| n | 888307 | 5.0% |
| o | 817782 | 4.6% |
| t | 774995 | 4.3% |
| l | 698421 | 3.9% |
| Other values (99) | 6940165 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12560571 | |
| Decimal Number | 1825392 | 10.2% |
| Space Separator | 1585803 | 8.8% |
| Uppercase Letter | 1178126 | 6.6% |
| Other Punctuation | 581846 | 3.2% |
| Close Punctuation | 105296 | 0.6% |
| Open Punctuation | 105296 | 0.6% |
| Dash Punctuation | 2722 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1485632 | |
| a | 1415466 | |
| i | 1243670 | |
| s | 1115918 | |
| r | 978896 | 7.8% |
| n | 888307 | 7.1% |
| o | 817782 | 6.5% |
| t | 774995 | 6.2% |
| l | 698421 | 5.6% |
| c | 591840 | 4.7% |
| Other values (47) | 2549644 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 146701 | |
| P | 100181 | 8.5% |
| S | 99223 | 8.4% |
| B | 93794 | 8.0% |
| M | 79260 | 6.7% |
| G | 78038 | 6.6% |
| W | 66342 | 5.6% |
| A | 64563 | 5.5% |
| L | 63670 | 5.4% |
| H | 59251 | 5.0% |
| Other values (22) | 327103 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 538747 | |
| 9 | 311438 | |
| 8 | 283152 | |
| 7 | 137355 | 7.5% |
| 6 | 113287 | 6.2% |
| 5 | 102651 | 5.6% |
| 2 | 99760 | 5.5% |
| 3 | 91117 | 5.0% |
| 4 | 74403 | 4.1% |
| 0 | 73482 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 460573 | |
| & | 80645 | 13.9% |
| . | 33035 | 5.7% |
| ' | 7591 | 1.3% |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1585803 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 105296 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 105296 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2722 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13738697 | |
| Common | 4206358 | 23.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1485632 | 10.8% |
| a | 1415466 | 10.3% |
| i | 1243670 | 9.1% |
| s | 1115918 | 8.1% |
| r | 978896 | 7.1% |
| n | 888307 | 6.5% |
| o | 817782 | 6.0% |
| t | 774995 | 5.6% |
| l | 698421 | 5.1% |
| c | 591840 | 4.3% |
| Other values (79) | 3727770 |
Common
| Value | Count | Frequency (%) |
| 1585803 | ||
| 1 | 538747 | 12.8% |
| , | 460573 | 10.9% |
| 9 | 311438 | 7.4% |
| 8 | 283152 | 6.7% |
| 7 | 137355 | 3.3% |
| 6 | 113287 | 2.7% |
| ) | 105296 | 2.5% |
| ( | 105296 | 2.5% |
| 5 | 102651 | 2.4% |
| Other values (10) | 462760 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17931313 | |
| None | 13742 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1585803 | 8.8% | |
| e | 1485632 | 8.3% |
| a | 1415466 | 7.9% |
| i | 1243670 | 6.9% |
| s | 1115918 | 6.2% |
| r | 978896 | 5.5% |
| n | 888307 | 5.0% |
| o | 817782 | 4.6% |
| t | 774995 | 4.3% |
| l | 698421 | 3.9% |
| Other values (61) | 6926423 |
None
| Value | Count | Frequency (%) |
| ü | 3637 | |
| ö | 2722 | |
| è | 2108 | |
| é | 2051 | |
| ú | 1773 | |
| ã | 292 | 2.1% |
| ë | 259 | 1.9% |
| ž | 160 | 1.2% |
| ä | 153 | 1.1% |
| å | 121 | 0.9% |
| Other values (28) | 466 | 3.4% |
Missing 
| Distinct | 3844 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 172643 |
| Missing (%) | 23.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 141 |
|---|---|
| Median length | 123 |
| Mean length | 59.08444638 |
| Min length | 5 |
Unique
| Unique | 743 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Laurasiatheria, Artiodactyla, Ruminatia, Bovidae |
|---|---|
| 2nd row | Animalia, Mollusca, Cephalopoda, Ammonoidea |
| 3rd row | Chromista, Foraminifera, Globothalamea, Rotaliida, Discocyclinidae |
| 4th row | Animalia, Chordata, Vertebrata, Pisces, Chondrichthyes, Elasmobranchii, Galeomorphii, Lamniformes, Odontaspididae |
| 5th row | Animalia, Brachiopoda, Rhynchonellata, Orthida, Enteletidae |
| Value | Count | Frequency (%) |
| animalia | 448323 | 15.7% |
| chordata | 148700 | 5.2% |
| vertebrata | 148618 | 5.2% |
| arthropoda | 100318 | 3.5% |
| mollusca | 69025 | 2.4% |
| brachiopoda | 66748 | 2.3% |
| foraminifera | 66301 | 2.3% |
| chromista | 65999 | 2.3% |
| mammalia | 60027 | 2.1% |
| eutheria | 57586 | 2.0% |
| Other values (3834) | 1620986 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | 9.8% |
| 2300766 | 7.1% | |
| , | 2260526 | 6.9% |
| o | 2052009 | 6.3% |
| r | 2005114 | 6.1% |
| e | 1809015 | 5.5% |
| t | 1671086 | 5.1% |
| l | 1501858 | 4.6% |
| n | 1400746 | 4.3% |
| Other values (51) | 9714233 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25197474 | |
| Uppercase Letter | 2811914 | 8.6% |
| Space Separator | 2300766 | 7.1% |
| Other Punctuation | 2295928 | 7.0% |
| Decimal Number | 471 | < 0.1% |
| Open Punctuation | 42 | < 0.1% |
| Close Punctuation | 42 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | |
| o | 2052009 | |
| r | 2005114 | |
| e | 1809015 | 7.2% |
| t | 1671086 | 6.6% |
| l | 1501858 | 6.0% |
| n | 1400746 | 5.6% |
| d | 1257138 | 5.0% |
| m | 1113235 | 4.4% |
| Other values (16) | 4495988 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 662527 | |
| C | 427513 | |
| P | 199516 | 7.1% |
| M | 161377 | 5.7% |
| V | 161299 | 5.7% |
| S | 144831 | 5.2% |
| E | 143204 | 5.1% |
| R | 141162 | 5.0% |
| B | 123534 | 4.4% |
| G | 116236 | 4.1% |
| Other values (16) | 530715 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2260526 | |
| . | 35391 | 1.5% |
| " | 8 | < 0.1% |
| ? | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2300766 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 471 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 42 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 42 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28009388 | |
| Common | 4597250 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | |
| o | 2052009 | 7.3% |
| r | 2005114 | 7.2% |
| e | 1809015 | 6.5% |
| t | 1671086 | 6.0% |
| l | 1501858 | 5.4% |
| n | 1400746 | 5.0% |
| d | 1257138 | 4.5% |
| m | 1113235 | 4.0% |
| Other values (42) | 7307902 |
Common
| Value | Count | Frequency (%) |
| 2300766 | ||
| , | 2260526 | |
| . | 35391 | 0.8% |
| 0 | 471 | < 0.1% |
| ( | 42 | < 0.1% |
| ) | 42 | < 0.1% |
| " | 8 | < 0.1% |
| ? | 3 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32606638 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | 9.8% |
| 2300766 | 7.1% | |
| , | 2260526 | 6.9% |
| o | 2052009 | 6.3% |
| r | 2005114 | 6.1% |
| e | 1809015 | 5.5% |
| t | 1671086 | 5.1% |
| l | 1501858 | 4.6% |
| n | 1400746 | 4.3% |
| Other values (51) | 9714233 |
kingdom
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 9.46887543 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | incertae sedis |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Chromista |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 446288 | |
| incertae | 171929 | 19.2% |
| sedis | 171929 | 19.2% |
| chromista | 69124 | 7.7% |
| plantae | 36324 | 4.1% |
| bacteria | 502 | 0.1% |
| protozoa | 287 | < 0.1% |
| fungi | 54 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1306114 | |
| a | 1207568 | |
| n | 654595 | |
| e | 552613 | |
| m | 515412 | 7.5% |
| l | 482612 | 7.0% |
| A | 446288 | 6.5% |
| s | 412982 | 6.0% |
| t | 278166 | 4.1% |
| r | 241842 | 3.5% |
| Other values (12) | 762084 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6135768 | |
| Uppercase Letter | 552579 | 8.1% |
| Space Separator | 171929 | 2.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1306114 | |
| a | 1207568 | |
| n | 654595 | |
| e | 552613 | |
| m | 515412 | 8.4% |
| l | 482612 | 7.9% |
| s | 412982 | 6.7% |
| t | 278166 | 4.5% |
| r | 241842 | 3.9% |
| c | 172431 | 2.8% |
| Other values (6) | 311433 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 446288 | |
| C | 69124 | 12.5% |
| P | 36611 | 6.6% |
| B | 502 | 0.1% |
| F | 54 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 171929 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6688347 | |
| Common | 171929 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1306114 | |
| a | 1207568 | |
| n | 654595 | |
| e | 552613 | |
| m | 515412 | 7.7% |
| l | 482612 | 7.2% |
| A | 446288 | 6.7% |
| s | 412982 | 6.2% |
| t | 278166 | 4.2% |
| r | 241842 | 3.6% |
| Other values (11) | 590155 |
Common
| Value | Count | Frequency (%) |
| 171929 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6860276 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1306114 | |
| a | 1207568 | |
| n | 654595 | |
| e | 552613 | |
| m | 515412 | 7.5% |
| l | 482612 | 7.0% |
| A | 446288 | 6.5% |
| s | 412982 | 6.0% |
| t | 278166 | 4.1% |
| r | 241842 | 3.5% |
| Other values (12) | 762084 |
phylum
Text
Missing 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 192842 |
| Missing (%) | 26.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 9.682191451 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Mollusca |
| 3rd row | Foraminifera |
| 4th row | Chordata |
| 5th row | Brachiopoda |
| Value | Count | Frequency (%) |
| chordata | 148527 | |
| arthropoda | 101505 | |
| mollusca | 66708 | |
| foraminifera | 66099 | |
| brachiopoda | 65633 | |
| echinodermata | 27100 | 5.1% |
| tracheophyta | 21340 | 4.0% |
| bryozoa | 13677 | 2.6% |
| cnidaria | 6914 | 1.3% |
| annelida | 3027 | 0.6% |
| Other values (30) | 11136 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 872631 | |
| o | 702054 | |
| r | 631416 | |
| h | 394973 | 7.7% |
| d | 356870 | 6.9% |
| t | 305449 | 5.9% |
| i | 250365 | 4.9% |
| p | 194047 | 3.8% |
| c | 186323 | 3.6% |
| C | 156569 | 3.0% |
| Other values (25) | 1096995 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4616026 | |
| Uppercase Letter | 531666 | 10.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 872631 | |
| o | 702054 | |
| r | 631416 | |
| h | 394973 | |
| d | 356870 | |
| t | 305449 | 6.6% |
| i | 250365 | 5.4% |
| p | 194047 | 4.2% |
| c | 186323 | 4.0% |
| l | 138356 | 3.0% |
| Other values (10) | 583542 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 156569 | |
| A | 104600 | |
| B | 79330 | |
| M | 66964 | |
| F | 66099 | |
| E | 27115 | 5.1% |
| T | 21386 | 4.0% |
| P | 4292 | 0.8% |
| O | 2549 | 0.5% |
| H | 2348 | 0.4% |
| Other values (5) | 414 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5147692 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 872631 | |
| o | 702054 | |
| r | 631416 | |
| h | 394973 | 7.7% |
| d | 356870 | 6.9% |
| t | 305449 | 5.9% |
| i | 250365 | 4.9% |
| p | 194047 | 3.8% |
| c | 186323 | 3.6% |
| C | 156569 | 3.0% |
| Other values (25) | 1096995 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5147692 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 872631 | |
| o | 702054 | |
| r | 631416 | |
| h | 394973 | 7.7% |
| d | 356870 | 6.9% |
| t | 305449 | 5.9% |
| i | 250365 | 4.9% |
| p | 194047 | 3.8% |
| c | 186323 | 3.6% |
| C | 156569 | 3.0% |
| Other values (25) | 1096995 |
class
Text
Missing 
| Distinct | 92 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 272566 |
| Missing (%) | 37.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 9.989064969 |
| Min length | 4 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mammalia |
|---|---|
| 2nd row | Cephalopoda |
| 3rd row | Globothalamea |
| 4th row | Elasmobranchii |
| 5th row | Rhynchonellata |
| Value | Count | Frequency (%) |
| mammalia | 59795 | |
| globothalamea | 42882 | 9.5% |
| rhynchonellata | 39551 | 8.8% |
| aves | 34584 | 7.7% |
| insecta | 32733 | 7.2% |
| gastropoda | 24245 | 5.4% |
| ostracoda | 23481 | 5.2% |
| elasmobranchii | 23303 | 5.2% |
| trilobita | 22315 | 4.9% |
| bivalvia | 22257 | 4.9% |
| Other values (82) | 126796 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 873045 | |
| o | 420982 | 9.3% |
| l | 396601 | 8.8% |
| i | 316058 | 7.0% |
| t | 241147 | 5.3% |
| e | 235006 | 5.2% |
| m | 212254 | 4.7% |
| n | 206140 | 4.6% |
| h | 195566 | 4.3% |
| s | 167736 | 3.7% |
| Other values (32) | 1249943 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4062536 | |
| Uppercase Letter | 451942 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 873045 | |
| o | 420982 | |
| l | 396601 | |
| i | 316058 | 7.8% |
| t | 241147 | 5.9% |
| e | 235006 | 5.8% |
| m | 212254 | 5.2% |
| n | 206140 | 5.1% |
| h | 195566 | 4.8% |
| s | 167736 | 4.1% |
| Other values (13) | 798001 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 92592 | |
| G | 71598 | |
| A | 42619 | |
| R | 40038 | |
| I | 32733 | 7.2% |
| E | 32684 | 7.2% |
| C | 30885 | 6.8% |
| T | 29606 | 6.6% |
| B | 25001 | 5.5% |
| O | 23622 | 5.2% |
| Other values (9) | 30564 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4514478 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 873045 | |
| o | 420982 | 9.3% |
| l | 396601 | 8.8% |
| i | 316058 | 7.0% |
| t | 241147 | 5.3% |
| e | 235006 | 5.2% |
| m | 212254 | 4.7% |
| n | 206140 | 4.6% |
| h | 195566 | 4.3% |
| s | 167736 | 3.7% |
| Other values (32) | 1249943 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4514478 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 873045 | |
| o | 420982 | 9.3% |
| l | 396601 | 8.8% |
| i | 316058 | 7.0% |
| t | 241147 | 5.3% |
| e | 235006 | 5.2% |
| m | 212254 | 4.7% |
| n | 206140 | 4.6% |
| h | 195566 | 4.3% |
| s | 167736 | 3.7% |
| Other values (32) | 1249943 |
order
Text
Missing 
| Distinct | 484 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 369296 |
| Missing (%) | 51.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 17 |
| Mean length | 11.06623369 |
| Min length | 5 |
Unique
| Unique | 41 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Artiodactyla |
|---|---|
| 2nd row | Ceratitida |
| 3rd row | Rotaliida |
| 4th row | Lamniformes |
| 5th row | Procellariiformes |
| Value | Count | Frequency (%) |
| rotaliida | 32460 | 9.1% |
| diptera | 14185 | 4.0% |
| porocephalida | 14086 | 4.0% |
| podocopida | 12424 | 3.5% |
| lamniformes | 11376 | 3.2% |
| cetacea | 10382 | 2.9% |
| procellariiformes | 9895 | 2.8% |
| artiodactyla | 8981 | 2.5% |
| terebratulida | 8715 | 2.5% |
| perissodactyla | 7870 | 2.2% |
| Other values (474) | 224838 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 543173 | |
| i | 471633 | |
| o | 372619 | 9.5% |
| r | 281531 | 7.2% |
| e | 276408 | 7.0% |
| d | 260645 | 6.6% |
| t | 210238 | 5.3% |
| l | 209185 | 5.3% |
| s | 166314 | 4.2% |
| c | 134203 | 3.4% |
| Other values (40) | 1004910 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3575647 | |
| Uppercase Letter | 355212 | 9.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 543173 | |
| i | 471633 | |
| o | 372619 | |
| r | 281531 | |
| e | 276408 | |
| d | 260645 | |
| t | 210238 | 5.9% |
| l | 209185 | 5.9% |
| s | 166314 | 4.7% |
| c | 134203 | 3.8% |
| Other values (16) | 649698 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 86599 | |
| C | 54230 | |
| R | 49477 | |
| L | 31460 | 8.9% |
| A | 26066 | 7.3% |
| T | 20328 | 5.7% |
| D | 18011 | 5.1% |
| N | 13824 | 3.9% |
| M | 13070 | 3.7% |
| S | 11063 | 3.1% |
| Other values (14) | 31084 | 8.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3930859 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 543173 | |
| i | 471633 | |
| o | 372619 | 9.5% |
| r | 281531 | 7.2% |
| e | 276408 | 7.0% |
| d | 260645 | 6.6% |
| t | 210238 | 5.3% |
| l | 209185 | 5.3% |
| s | 166314 | 4.2% |
| c | 134203 | 3.4% |
| Other values (40) | 1004910 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3930859 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 543173 | |
| i | 471633 | |
| o | 372619 | 9.5% |
| r | 281531 | 7.2% |
| e | 276408 | 7.0% |
| d | 260645 | 6.6% |
| t | 210238 | 5.3% |
| l | 209185 | 5.3% |
| s | 166314 | 4.2% |
| c | 134203 | 3.4% |
| Other values (40) | 1004910 |
family
Text
Missing 
| Distinct | 4830 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 258765 |
| Missing (%) | 35.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 12.53716749 |
| Min length | 5 |
Unique
| Unique | 637 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Bovidae |
|---|---|
| 2nd row | Acrochordiceratidae |
| 3rd row | Orbitoclypeidae |
| 4th row | Odontaspididae |
| 5th row | Enteletidae |
| Value | Count | Frequency (%) |
| subtriquetridae | 14086 | 3.0% |
| milichiidae | 13693 | 2.9% |
| procellariidae | 9409 | 2.0% |
| lamnidae | 7013 | 1.5% |
| carcharhinidae | 5646 | 1.2% |
| anatidae | 5251 | 1.1% |
| phocidae | 4763 | 1.0% |
| vaginulinidae | 3864 | 0.8% |
| equidae | 3840 | 0.8% |
| physeteridae | 3794 | 0.8% |
| Other values (4820) | 394384 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 849890 | |
| e | 786222 | |
| a | 738375 | |
| d | 534378 | |
| r | 330492 | 5.7% |
| o | 326737 | 5.6% |
| l | 281417 | 4.8% |
| t | 274019 | 4.7% |
| n | 228653 | 3.9% |
| c | 216586 | 3.7% |
| Other values (42) | 1272329 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5373355 | |
| Uppercase Letter | 465743 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 849890 | |
| e | 786222 | |
| a | 738375 | |
| d | 534378 | |
| r | 330492 | 6.2% |
| o | 326737 | 6.1% |
| l | 281417 | 5.2% |
| t | 274019 | 5.1% |
| n | 228653 | 4.3% |
| c | 216586 | 4.0% |
| Other values (16) | 806586 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 63107 | |
| C | 50262 | |
| S | 46268 | |
| M | 38138 | 8.2% |
| A | 35135 | 7.5% |
| L | 27292 | 5.9% |
| T | 27199 | 5.8% |
| H | 23746 | 5.1% |
| E | 21742 | 4.7% |
| B | 20654 | 4.4% |
| Other values (16) | 112200 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5839098 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 849890 | |
| e | 786222 | |
| a | 738375 | |
| d | 534378 | |
| r | 330492 | 5.7% |
| o | 326737 | 5.6% |
| l | 281417 | 4.8% |
| t | 274019 | 4.7% |
| n | 228653 | 3.9% |
| c | 216586 | 3.7% |
| Other values (42) | 1272329 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5839098 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 849890 | |
| e | 786222 | |
| a | 738375 | |
| d | 534378 | |
| r | 330492 | 5.7% |
| o | 326737 | 5.6% |
| l | 281417 | 4.8% |
| t | 274019 | 4.7% |
| n | 228653 | 3.9% |
| c | 216586 | 3.7% |
| Other values (42) | 1272329 |
genus
Text
Missing 
| Distinct | 20048 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 245070 |
| Missing (%) | 33.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 10.1276432 |
| Min length | 3 |
Unique
| Unique | 4473 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | Damaliscus |
|---|---|
| 2nd row | Acrochordiceras |
| 3rd row | Asterocyclina |
| 4th row | Carcharias |
| 5th row | Enteletes |
| Value | Count | Frequency (%) |
| genus | 13850 | 2.9% |
| marrella | 12281 | 2.6% |
| pterodroma | 6789 | 1.4% |
| callophoca | 3770 | 0.8% |
| physeterula | 3029 | 0.6% |
| carcharhinus | 2974 | 0.6% |
| australca | 2250 | 0.5% |
| thambetochen | 2208 | 0.5% |
| hustedia | 2080 | 0.4% |
| branta | 2051 | 0.4% |
| Other values (20038) | 428156 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 524027 | 10.8% |
| i | 409959 | 8.4% |
| o | 399283 | 8.2% |
| e | 377679 | 7.8% |
| r | 355924 | 7.3% |
| s | 324654 | 6.7% |
| l | 308448 | 6.4% |
| n | 254099 | 5.2% |
| t | 240655 | 5.0% |
| u | 219806 | 4.5% |
| Other values (43) | 1441043 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4376122 | |
| Uppercase Letter | 479438 | 9.9% |
| Dash Punctuation | 17 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 524027 | |
| i | 409959 | |
| o | 399283 | |
| e | 377679 | 8.6% |
| r | 355924 | 8.1% |
| s | 324654 | 7.4% |
| l | 308448 | 7.0% |
| n | 254099 | 5.8% |
| t | 240655 | 5.5% |
| u | 219806 | 5.0% |
| Other values (16) | 961588 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 67119 | |
| C | 57423 | |
| M | 39323 | 8.2% |
| A | 37322 | 7.8% |
| S | 33234 | 6.9% |
| G | 31949 | 6.7% |
| H | 25258 | 5.3% |
| T | 25070 | 5.2% |
| B | 24416 | 5.1% |
| L | 23028 | 4.8% |
| Other values (16) | 115296 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4855560 | |
| Common | 17 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 524027 | 10.8% |
| i | 409959 | 8.4% |
| o | 399283 | 8.2% |
| e | 377679 | 7.8% |
| r | 355924 | 7.3% |
| s | 324654 | 6.7% |
| l | 308448 | 6.4% |
| n | 254099 | 5.2% |
| t | 240655 | 5.0% |
| u | 219806 | 4.5% |
| Other values (42) | 1441026 |
Common
| Value | Count | Frequency (%) |
| - | 17 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4855577 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 524027 | 10.8% |
| i | 409959 | 8.4% |
| o | 399283 | 8.2% |
| e | 377679 | 7.8% |
| r | 355924 | 7.3% |
| s | 324654 | 6.7% |
| l | 308448 | 6.4% |
| n | 254099 | 5.2% |
| t | 240655 | 5.0% |
| u | 219806 | 4.5% |
| Other values (43) | 1441043 |
genericName
Text
Missing 
| Distinct | 19254 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 244897 |
| Missing (%) | 33.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 10.00970995 |
| Min length | 3 |
Unique
| Unique | 4453 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | Damaliscus |
|---|---|
| 2nd row | Acrochordiceras |
| 3rd row | Discocyclina |
| 4th row | Odontaspis |
| 5th row | Enteletes |
| Value | Count | Frequency (%) |
| genus | 13850 | 2.9% |
| marrella | 12281 | 2.6% |
| pterodroma | 7305 | 1.5% |
| callophoca | 3770 | 0.8% |
| isurus | 3463 | 0.7% |
| physeterula | 3029 | 0.6% |
| carcharhinus | 2930 | 0.6% |
| australca | 2250 | 0.5% |
| thambetochen | 2208 | 0.5% |
| hustedia | 2082 | 0.4% |
| Other values (19244) | 426443 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 519031 | 10.8% |
| i | 403892 | 8.4% |
| o | 387257 | 8.1% |
| e | 374510 | 7.8% |
| r | 356780 | 7.4% |
| s | 320239 | 6.7% |
| l | 307792 | 6.4% |
| n | 251588 | 5.2% |
| t | 236485 | 4.9% |
| u | 219057 | 4.6% |
| Other values (45) | 1424136 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4321139 | |
| Uppercase Letter | 479611 | 10.0% |
| Dash Punctuation | 17 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 519031 | |
| i | 403892 | |
| o | 387257 | |
| e | 374510 | 8.7% |
| r | 356780 | 8.3% |
| s | 320239 | 7.4% |
| l | 307792 | 7.1% |
| n | 251588 | 5.8% |
| t | 236485 | 5.5% |
| u | 219057 | 5.1% |
| Other values (18) | 944508 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 64637 | |
| C | 57713 | |
| M | 37998 | 7.9% |
| A | 37686 | 7.9% |
| G | 33907 | 7.1% |
| S | 33774 | 7.0% |
| H | 25624 | 5.3% |
| T | 25159 | 5.2% |
| B | 24452 | 5.1% |
| L | 22457 | 4.7% |
| Other values (16) | 116204 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4800750 | |
| Common | 17 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 519031 | 10.8% |
| i | 403892 | 8.4% |
| o | 387257 | 8.1% |
| e | 374510 | 7.8% |
| r | 356780 | 7.4% |
| s | 320239 | 6.7% |
| l | 307792 | 6.4% |
| n | 251588 | 5.2% |
| t | 236485 | 4.9% |
| u | 219057 | 4.6% |
| Other values (44) | 1424119 |
Common
| Value | Count | Frequency (%) |
| - | 17 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4800593 | |
| None | 174 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 519031 | 10.8% |
| i | 403892 | 8.4% |
| o | 387257 | 8.1% |
| e | 374510 | 7.8% |
| r | 356780 | 7.4% |
| s | 320239 | 6.7% |
| l | 307792 | 6.4% |
| n | 251588 | 5.2% |
| t | 236485 | 4.9% |
| u | 219057 | 4.6% |
| Other values (43) | 1423962 |
None
| Value | Count | Frequency (%) |
| ë | 164 | |
| ö | 10 | 5.7% |
specificEpithet
Text
Missing 
| Distinct | 21987 |
|---|---|
| Distinct (%) | 8.0% |
| Missing | 449718 |
| Missing (%) | 62.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 8.738418429 |
| Min length | 2 |
Unique
| Unique | 6275 ? |
|---|---|
| Unique (%) | 2.3% |
Sample
| 1st row | lunatus |
|---|---|
| 2nd row | hyatti |
| 3rd row | sculpturata |
| 4th row | cuspidata |
| 5th row | rotundobesus |
| Value | Count | Frequency (%) |
| phaeopygia | 3232 | 1.2% |
| alba | 2027 | 0.7% |
| megalodon | 1648 | 0.6% |
| confluens | 1438 | 0.5% |
| obscura | 1243 | 0.5% |
| cahow | 1050 | 0.4% |
| hastalis | 917 | 0.3% |
| socialis | 884 | 0.3% |
| varians | 883 | 0.3% |
| paulus | 879 | 0.3% |
| Other values (21977) | 260589 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 304358 | |
| i | 265754 | |
| s | 232914 | |
| e | 187242 | 7.8% |
| n | 166271 | 6.9% |
| r | 155825 | 6.5% |
| u | 136921 | 5.7% |
| o | 136782 | 5.7% |
| l | 131891 | 5.5% |
| t | 131533 | 5.5% |
| Other values (19) | 551739 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2401217 | |
| Dash Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 304358 | |
| i | 265754 | |
| s | 232914 | |
| e | 187242 | 7.8% |
| n | 166271 | 6.9% |
| r | 155825 | 6.5% |
| u | 136921 | 5.7% |
| o | 136782 | 5.7% |
| l | 131891 | 5.5% |
| t | 131533 | 5.5% |
| Other values (18) | 551726 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2401217 | |
| Common | 13 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 304358 | |
| i | 265754 | |
| s | 232914 | |
| e | 187242 | 7.8% |
| n | 166271 | 6.9% |
| r | 155825 | 6.5% |
| u | 136921 | 5.7% |
| o | 136782 | 5.7% |
| l | 131891 | 5.5% |
| t | 131533 | 5.5% |
| Other values (18) | 551726 |
Common
| Value | Count | Frequency (%) |
| - | 13 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2401228 | |
| None | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 304358 | |
| i | 265754 | |
| s | 232914 | |
| e | 187242 | 7.8% |
| n | 166271 | 6.9% |
| r | 155825 | 6.5% |
| u | 136921 | 5.7% |
| o | 136782 | 5.7% |
| l | 131891 | 5.5% |
| t | 131533 | 5.5% |
| Other values (17) | 551737 |
None
| Value | Count | Frequency (%) |
| ü | 1 | |
| ö | 1 |
Missing 
| Distinct | 1469 |
|---|---|
| Distinct (%) | 23.3% |
| Missing | 718207 |
| Missing (%) | 99.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 15 |
| Mean length | 9.022536105 |
| Min length | 2 |
Unique
| Unique | 540 ? |
|---|---|
| Unique (%) | 8.6% |
Sample
| 1st row | cooperensis |
|---|---|
| 2nd row | subdecorata |
| 3rd row | advena |
| 4th row | convexa |
| 5th row | poloumera |
| Value | Count | Frequency (%) |
| burchellii | 494 | 7.8% |
| antarctica | 104 | 1.7% |
| inflata | 67 | 1.1% |
| vancouveriensis | 64 | 1.0% |
| mexicana | 54 | 0.9% |
| ornata | 50 | 0.8% |
| caurina | 42 | 0.7% |
| erectus | 39 | 0.6% |
| texana | 33 | 0.5% |
| curta | 32 | 0.5% |
| Other values (1459) | 5322 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8251 | |
| i | 6485 | |
| s | 4930 | |
| e | 4526 | 8.0% |
| n | 4069 | 7.2% |
| t | 3680 | 6.5% |
| r | 3677 | 6.5% |
| l | 3463 | 6.1% |
| c | 3093 | 5.4% |
| u | 2924 | 5.1% |
| Other values (16) | 11753 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 56851 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8251 | |
| i | 6485 | |
| s | 4930 | |
| e | 4526 | 8.0% |
| n | 4069 | 7.2% |
| t | 3680 | 6.5% |
| r | 3677 | 6.5% |
| l | 3463 | 6.1% |
| c | 3093 | 5.4% |
| u | 2924 | 5.1% |
| Other values (16) | 11753 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56851 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8251 | |
| i | 6485 | |
| s | 4930 | |
| e | 4526 | 8.0% |
| n | 4069 | 7.2% |
| t | 3680 | 6.5% |
| r | 3677 | 6.5% |
| l | 3463 | 6.1% |
| c | 3093 | 5.4% |
| u | 2924 | 5.1% |
| Other values (16) | 11753 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56851 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8251 | |
| i | 6485 | |
| s | 4930 | |
| e | 4526 | 8.0% |
| n | 4069 | 7.2% |
| t | 3680 | 6.5% |
| r | 3677 | 6.5% |
| l | 3463 | 6.1% |
| c | 3093 | 5.4% |
| u | 2924 | 5.1% |
| Other values (16) | 11753 |
taxonRank
Text
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.306741955 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | KINGDOM |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 268489 | |
| genus | 204821 | |
| kingdom | 184360 | |
| class | 34827 | 4.8% |
| family | 11500 | 1.6% |
| order | 7792 | 1.1% |
| phylum | 6418 | 0.9% |
| subspecies | 3525 | 0.5% |
| variety | 2760 | 0.4% |
| form | 16 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 822028 | |
| E | 759401 | |
| I | 470634 | |
| G | 389181 | |
| N | 389181 | |
| C | 306841 | 6.7% |
| P | 278432 | 6.1% |
| U | 214764 | 4.7% |
| M | 202294 | 4.4% |
| O | 192168 | 4.2% |
| Other values (11) | 544361 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4569285 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 822028 | |
| E | 759401 | |
| I | 470634 | |
| G | 389181 | |
| N | 389181 | |
| C | 306841 | 6.7% |
| P | 278432 | 6.1% |
| U | 214764 | 4.7% |
| M | 202294 | 4.4% |
| O | 192168 | 4.2% |
| Other values (11) | 544361 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4569285 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 822028 | |
| E | 759401 | |
| I | 470634 | |
| G | 389181 | |
| N | 389181 | |
| C | 306841 | 6.7% |
| P | 278432 | 6.1% |
| U | 214764 | 4.7% |
| M | 202294 | 4.4% |
| O | 192168 | 4.2% |
| Other values (11) | 544361 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4569285 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 822028 | |
| E | 759401 | |
| I | 470634 | |
| G | 389181 | |
| N | 389181 | |
| C | 306841 | 6.7% |
| P | 278432 | 6.1% |
| U | 214764 | 4.7% |
| M | 202294 | 4.4% |
| O | 192168 | 4.2% |
| Other values (11) | 544361 |
taxonomicStatus
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 171789 |
| Missing (%) | 23.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.856598018 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | SYNONYM |
| 4th row | SYNONYM |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 431194 | |
| synonym | 79261 | 14.3% |
| doubtful | 42264 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 862388 | |
| E | 862388 | |
| T | 473458 | |
| D | 473458 | |
| A | 431194 | |
| P | 431194 | |
| Y | 158522 | 3.7% |
| N | 158522 | 3.7% |
| O | 121525 | 2.8% |
| U | 84528 | 1.9% |
| Other values (5) | 285314 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4342491 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 862388 | |
| E | 862388 | |
| T | 473458 | |
| D | 473458 | |
| A | 431194 | |
| P | 431194 | |
| Y | 158522 | 3.7% |
| N | 158522 | 3.7% |
| O | 121525 | 2.8% |
| U | 84528 | 1.9% |
| Other values (5) | 285314 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4342491 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 862388 | |
| E | 862388 | |
| T | 473458 | |
| D | 473458 | |
| A | 431194 | |
| P | 431194 | |
| Y | 158522 | 3.7% |
| N | 158522 | 3.7% |
| O | 121525 | 2.8% |
| U | 84528 | 1.9% |
| Other values (5) | 285314 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4342491 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 862388 | |
| E | 862388 | |
| T | 473458 | |
| D | 473458 | |
| A | 431194 | |
| P | 431194 | |
| Y | 158522 | 3.7% |
| N | 158522 | 3.7% |
| O | 121525 | 2.8% |
| U | 84528 | 1.9% |
| Other values (5) | 285314 | 6.6% |
datasetKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | c8681cc2-9d0a-4c5f-b620-5c753abfe2bc |
|---|---|
| 2nd row | c8681cc2-9d0a-4c5f-b620-5c753abfe2bc |
| 3rd row | c8681cc2-9d0a-4c5f-b620-5c753abfe2bc |
| 4th row | c8681cc2-9d0a-4c5f-b620-5c753abfe2bc |
| 5th row | c8681cc2-9d0a-4c5f-b620-5c753abfe2bc |
| Value | Count | Frequency (%) |
| c8681cc2-9d0a-4c5f-b620-5c753abfe2bc | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 4347048 | |
| - | 2898032 | |
| b | 2173524 | |
| 2 | 2173524 | |
| 5 | 2173524 | |
| 8 | 1449016 | 5.6% |
| f | 1449016 | 5.6% |
| a | 1449016 | 5.6% |
| 0 | 1449016 | 5.6% |
| 6 | 1449016 | 5.6% |
| Other values (7) | 5071556 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12316636 | |
| Lowercase Letter | 10867620 | |
| Dash Punctuation | 2898032 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2173524 | |
| 5 | 2173524 | |
| 8 | 1449016 | |
| 0 | 1449016 | |
| 6 | 1449016 | |
| 4 | 724508 | 5.9% |
| 9 | 724508 | 5.9% |
| 1 | 724508 | 5.9% |
| 7 | 724508 | 5.9% |
| 3 | 724508 | 5.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4347048 | |
| b | 2173524 | |
| f | 1449016 | 13.3% |
| a | 1449016 | 13.3% |
| d | 724508 | 6.7% |
| e | 724508 | 6.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2898032 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15214668 | |
| Latin | 10867620 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 2898032 | |
| 2 | 2173524 | |
| 5 | 2173524 | |
| 8 | 1449016 | |
| 0 | 1449016 | |
| 6 | 1449016 | |
| 4 | 724508 | 4.8% |
| 9 | 724508 | 4.8% |
| 1 | 724508 | 4.8% |
| 7 | 724508 | 4.8% |
Latin
| Value | Count | Frequency (%) |
| c | 4347048 | |
| b | 2173524 | |
| f | 1449016 | 13.3% |
| a | 1449016 | 13.3% |
| d | 724508 | 6.7% |
| e | 724508 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26082288 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 4347048 | |
| - | 2898032 | |
| b | 2173524 | |
| 2 | 2173524 | |
| 5 | 2173524 | |
| 8 | 1449016 | 5.6% |
| f | 1449016 | 5.6% |
| a | 1449016 | 5.6% |
| 0 | 1449016 | 5.6% |
| 6 | 1449016 | 5.6% |
| Other values (7) | 5071556 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1449016 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1449016 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1449016 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 |
lastInterpreted
Text
| Distinct | 37858 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99520778 |
| Min length | 20 |
Unique
| Unique | 984 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2024-12-02T10:16:26.190Z |
|---|---|
| 2nd row | 2024-12-02T10:16:26.321Z |
| 3rd row | 2024-12-02T10:16:26.322Z |
| 4th row | 2024-12-02T10:16:26.322Z |
| 5th row | 2024-12-02T10:16:26.323Z |
| Value | Count | Frequency (%) |
| 2024-12-02t10:17:03.880z | 100 | < 0.1% |
| 2024-12-02t10:17:08.512z | 92 | < 0.1% |
| 2024-12-02t10:17:04.870z | 87 | < 0.1% |
| 2024-12-02t10:17:05.654z | 87 | < 0.1% |
| 2024-12-02t10:16:52.136z | 85 | < 0.1% |
| 2024-12-02t10:16:59.768z | 85 | < 0.1% |
| 2024-12-02t10:17:07.114z | 85 | < 0.1% |
| 2024-12-02t10:16:58.778z | 84 | < 0.1% |
| 2024-12-02t10:17:03.172z | 84 | < 0.1% |
| 2024-12-02t10:17:08.976z | 83 | < 0.1% |
| Other values (37848) | 723636 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 1185381 | 6.8% |
| 6 | 810755 | 4.7% |
| T | 724508 | 4.2% |
| Z | 724508 | 4.2% |
| . | 723640 | 4.2% |
| Other values (5) | 2004001 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12314032 | |
| Other Punctuation | 2172656 | 12.5% |
| Dash Punctuation | 1449016 | 8.3% |
| Uppercase Letter | 1449016 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| 4 | 1185381 | 9.6% |
| 6 | 810755 | 6.6% |
| 7 | 497480 | 4.0% |
| 5 | 488259 | 4.0% |
| 3 | 420464 | 3.4% |
| 9 | 301334 | 2.4% |
| 8 | 296464 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1449016 | |
| . | 723640 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1449016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15935704 | |
| Latin | 1449016 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 1185381 | 7.4% |
| 6 | 810755 | 5.1% |
| . | 723640 | 4.5% |
| 7 | 497480 | 3.1% |
| 5 | 488259 | 3.1% |
| Other values (3) | 1018262 | 6.4% |
Latin
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17384720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 1185381 | 6.8% |
| 6 | 810755 | 4.7% |
| T | 724508 | 4.2% |
| Z | 724508 | 4.2% |
| . | 723640 | 4.2% |
| Other values (5) | 2004001 |
distanceFromCentroidInMeters
Real number (ℝ)
Missing 
| Distinct | 149 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 723864 |
| Missing (%) | 99.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2256.841767 |
| Minimum | 0 |
|---|---|
| Maximum | 4992.37105 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 857.2535536 |
| Q1 | 857.2535536 |
| median | 2818.630536 |
| Q3 | 2818.630536 |
| 95-th percentile | 4618.527309 |
| Maximum | 4992.37105 |
| Range | 4992.37105 |
| Interquartile range (IQR) | 1961.376982 |
Descriptive statistics
| Standard deviation | 1312.822223 |
|---|---|
| Coefficient of variation (CV) | 0.5817076951 |
| Kurtosis | -1.028590401 |
| Mean | 2256.841767 |
| Median Absolute Deviation (MAD) | 1402.236206 |
| Skewness | 0.30591539 |
| Sum | 1453406.098 |
| Variance | 1723502.188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 857.2535536 | 226 | < 0.1% |
| 2818.630536 | 202 | < 0.1% |
| 4618.527309 | 27 | < 0.1% |
| 3800.284004 | 12 | < 0.1% |
| 1824.519626 | 10 | < 0.1% |
| 0 | 6 | < 0.1% |
| 1543.140798 | 6 | < 0.1% |
| 4852.601363 | 5 | < 0.1% |
| 3114.471841 | 4 | < 0.1% |
| 3029.93085 | 3 | < 0.1% |
| Other values (139) | 143 | < 0.1% |
| (Missing) | 723864 |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 253.452652 | 1 | < 0.1% |
| 533.2556305 | 1 | < 0.1% |
| 599.6747027 | 1 | < 0.1% |
| 605.9334686 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4992.37105 | 1 | |
| 4985.80659 | 1 | |
| 4984.258263 | 1 | |
| 4978.129443 | 1 | |
| 4968.052222 | 1 |
issue
Text
| Distinct | 154 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 193 |
| Missing (%) | < 0.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 186 |
|---|---|
| Median length | 181 |
| Mean length | 68.38031105 |
| Min length | 17 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;TAXON_MATCH_NONE |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 288609 | |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 165166 | |
| occurrence_status_inferred_from_individual_count;taxon_match_none | 89011 | 12.3% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;taxon_match_none | 34505 | 4.8% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country | 25422 | 3.5% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_coordinate_mismatch;taxon_match_none | 15005 | 2.1% |
| occurrence_status_inferred_from_individual_count;recorded_date_mismatch | 12501 | 1.7% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid;taxon_match_none | 11612 | 1.6% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy | 10754 | 1.5% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;taxon_match_higherrank | 10043 | 1.4% |
| Other values (144) | 61687 | 8.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 5044685 | |
| R | 4292364 | 8.7% |
| N | 4233244 | 8.5% |
| E | 3964433 | 8.0% |
| C | 3659582 | 7.4% |
| I | 3552475 | 7.2% |
| T | 3530938 | 7.1% |
| U | 3207403 | 6.5% |
| O | 3183427 | 6.4% |
| D | 2832439 | 5.7% |
| Other values (18) | 12027895 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 43643926 | |
| Connector Punctuation | 5044685 | 10.2% |
| Other Punctuation | 633292 | 1.3% |
| Decimal Number | 206982 | 0.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 4292364 | |
| N | 4233244 | |
| E | 3964433 | |
| C | 3659582 | |
| I | 3552475 | |
| T | 3530938 | |
| U | 3207403 | 7.3% |
| O | 3183427 | 7.3% |
| D | 2832439 | 6.5% |
| A | 2762222 | 6.3% |
| Other values (14) | 8425399 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 103491 | |
| 4 | 103491 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5044685 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 633292 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43643926 | |
| Common | 5884959 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 4292364 | |
| N | 4233244 | |
| E | 3964433 | |
| C | 3659582 | |
| I | 3552475 | |
| T | 3530938 | |
| U | 3207403 | 7.3% |
| O | 3183427 | 7.3% |
| D | 2832439 | 6.5% |
| A | 2762222 | 6.3% |
| Other values (14) | 8425399 |
Common
| Value | Count | Frequency (%) |
| _ | 5044685 | |
| ; | 633292 | 10.8% |
| 8 | 103491 | 1.8% |
| 4 | 103491 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49528885 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 5044685 | |
| R | 4292364 | 8.7% |
| N | 4233244 | 8.5% |
| E | 3964433 | 8.0% |
| C | 3659582 | 7.4% |
| I | 3552475 | 7.2% |
| T | 3530938 | 7.1% |
| U | 3207403 | 6.5% |
| O | 3183427 | 6.4% |
| D | 2832439 | 5.7% |
| Other values (18) | 12027895 |
mediaType
Text
Missing 
| Distinct | 58 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 637882 |
| Missing (%) | 88.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 1110 |
|---|---|
| Median length | 1099 |
| Mean length | 20.60165539 |
| Min length | 10 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage;StillImage |
| Value | Count | Frequency (%) |
| stillimage | 36835 | |
| stillimage;stillimage | 35396 | |
| stillimage;stillimage;stillimage;stillimage | 5461 | 6.3% |
| stillimage;stillimage;stillimage | 5225 | 6.0% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 2625 | 3.0% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 354 | 0.4% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 145 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 132 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 79 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 74 | 0.1% |
| Other values (48) | 300 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 340230 | |
| S | 170115 | |
| t | 170115 | |
| i | 170115 | |
| I | 170115 | |
| m | 170115 | |
| a | 170115 | |
| g | 170115 | |
| e | 170115 | |
| ; | 83489 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1360920 | |
| Uppercase Letter | 340230 | 19.1% |
| Other Punctuation | 83489 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 340230 | |
| t | 170115 | |
| i | 170115 | |
| m | 170115 | |
| a | 170115 | |
| g | 170115 | |
| e | 170115 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 170115 | |
| I | 170115 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 83489 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1701150 | |
| Common | 83489 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 340230 | |
| S | 170115 | |
| t | 170115 | |
| i | 170115 | |
| I | 170115 | |
| m | 170115 | |
| a | 170115 | |
| g | 170115 | |
| e | 170115 |
Common
| Value | Count | Frequency (%) |
| ; | 83489 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1784639 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 340230 | |
| S | 170115 | |
| t | 170115 | |
| i | 170115 | |
| I | 170115 | |
| m | 170115 | |
| a | 170115 | |
| g | 170115 | |
| e | 170115 | |
| ; | 83489 | 4.7% |
hasCoordinate
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 707.7 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 620570 | |
| True | 103938 | 14.3% |
hasGeospatialIssues
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 707.7 KiB |
| False | |
|---|---|
| True | 1338 |
| Value | Count | Frequency (%) |
| False | 723170 | |
| True | 1338 | 0.2% |
taxonKey
Real number (ℝ)
Zeros 
| Distinct | 65365 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4156268.397 |
| Minimum | 0 |
|---|---|
| Maximum | 12387090 |
| Zeros | 171789 |
| Zeros (%) | 23.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6 |
| median | 4806932 |
| Q3 | 7794651 |
| 95-th percentile | 9255998 |
| Maximum | 12387090 |
| Range | 12387090 |
| Interquartile range (IQR) | 7794645 |
Descriptive statistics
| Standard deviation | 3563951.427 |
|---|---|
| Coefficient of variation (CV) | 0.857488277 |
| Kurtosis | -1.280507827 |
| Mean | 4156268.397 |
| Median Absolute Deviation (MAD) | 3776216.5 |
| Skewness | 0.1911040766 |
| Sum | 3.011249704 × 1012 |
| Variance | 1.270174977 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 171789 | 23.7% |
| 216 | 16872 | 2.3% |
| 8513230 | 13693 | 1.9% |
| 4806028 | 12281 | 1.7% |
| 6 | 11457 | 1.6% |
| 359 | 4656 | 0.6% |
| 44 | 4268 | 0.6% |
| 729 | 3674 | 0.5% |
| 353 | 3566 | 0.5% |
| 2481460 | 3232 | 0.4% |
| Other values (65355) | 479020 |
| Value | Count | Frequency (%) |
| 0 | 171789 | |
| 1 | 1114 | 0.2% |
| 6 | 11457 | 1.6% |
| 42 | 952 | 0.1% |
| 43 | 51 | < 0.1% |
| Value | Count | Frequency (%) |
| 12387090 | 1 | < 0.1% |
| 12385426 | 4 | |
| 12385220 | 2 | < 0.1% |
| 12383973 | 1 | < 0.1% |
| 12379591 | 6 |
acceptedTaxonKey
Real number (ℝ)
Missing 
| Distinct | 58335 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 171789 |
| Missing (%) | 23.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5515085.25 |
| Minimum | 1 |
|---|---|
| Maximum | 12385426 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 216 |
| Q1 | 3249393 |
| median | 4941659 |
| Q3 | 8513230 |
| 95-th percentile | 9626241 |
| Maximum | 12385426 |
| Range | 12385425 |
| Interquartile range (IQR) | 5263837 |
Descriptive statistics
| Standard deviation | 3184869.125 |
|---|---|
| Coefficient of variation (CV) | 0.5774832084 |
| Kurtosis | -0.8885403732 |
| Mean | 5515085.25 |
| Median Absolute Deviation (MAD) | 2688948 |
| Skewness | -0.1769613565 |
| Sum | 3.048292404 × 1012 |
| Variance | 1.014339134 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 216 | 16872 | 2.3% |
| 8513230 | 13693 | 1.9% |
| 4806028 | 12281 | 1.7% |
| 6 | 11457 | 1.6% |
| 359 | 4656 | 0.6% |
| 44 | 4268 | 0.6% |
| 729 | 3674 | 0.5% |
| 353 | 3566 | 0.5% |
| 2481460 | 3232 | 0.4% |
| 4832444 | 3022 | 0.4% |
| Other values (58325) | 475998 | |
| (Missing) | 171789 | 23.7% |
| Value | Count | Frequency (%) |
| 1 | 1114 | 0.2% |
| 6 | 11457 | |
| 42 | 952 | 0.1% |
| 43 | 51 | < 0.1% |
| 44 | 4268 | 0.6% |
| Value | Count | Frequency (%) |
| 12385426 | 4 | < 0.1% |
| 12385220 | 2 | < 0.1% |
| 12379591 | 6 | < 0.1% |
| 12362277 | 15 | |
| 12358726 | 5 | < 0.1% |
kingdomKey
Real number (ℝ)
Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.303661243 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 171929 |
| Zeros (%) | 23.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.508442348 |
|---|---|
| Coefficient of variation (CV) | 1.157081531 |
| Kurtosis | 2.902387583 |
| Mean | 1.303661243 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.923133372 |
| Sum | 944513 |
| Variance | 2.275398317 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 446288 | |
| 0 | 171929 | 23.7% |
| 4 | 69124 | 9.5% |
| 6 | 36324 | 5.0% |
| 3 | 502 | 0.1% |
| 7 | 287 | < 0.1% |
| 5 | 54 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 171929 | 23.7% |
| 1 | 446288 | |
| 3 | 502 | 0.1% |
| 4 | 69124 | 9.5% |
| 5 | 54 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 287 | < 0.1% |
| 6 | 36324 | |
| 5 | 54 | < 0.1% |
| 4 | 69124 | |
| 3 | 502 | 0.1% |
phylumKey
Real number (ℝ)
Missing 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 192842 |
| Missing (%) | 26.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1373510.734 |
| Minimum | 9 |
|---|---|
| Maximum | 12228025 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 44 |
| Q1 | 44 |
| median | 53 |
| Q3 | 110 |
| 95-th percentile | 8376456 |
| Maximum | 12228025 |
| Range | 12228016 |
| Interquartile range (IQR) | 66 |
Descriptive statistics
| Standard deviation | 3060656.412 |
|---|---|
| Coefficient of variation (CV) | 2.228345463 |
| Kurtosis | 1.208523107 |
| Mean | 1373510.734 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 1.786678435 |
| Sum | 7.302489577 × 1011 |
| Variance | 9.367617675 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44 | 148527 | |
| 54 | 101505 | |
| 52 | 66708 | 9.2% |
| 8376456 | 66099 | 9.1% |
| 110 | 65633 | 9.1% |
| 50 | 27100 | 3.7% |
| 7707728 | 21340 | 2.9% |
| 53 | 13677 | 1.9% |
| 43 | 6914 | 1.0% |
| 42 | 3027 | 0.4% |
| Other values (30) | 11136 | 1.5% |
| (Missing) | 192842 |
| Value | Count | Frequency (%) |
| 9 | 20 | < 0.1% |
| 14 | 46 | < 0.1% |
| 32 | 1 | < 0.1% |
| 33 | 225 | |
| 35 | 20 | < 0.1% |
| Value | Count | Frequency (%) |
| 12228025 | 12 | < 0.1% |
| 9778081 | 1 | < 0.1% |
| 8770992 | 11 | < 0.1% |
| 8376456 | 66099 | |
| 8173593 | 15 | < 0.1% |
classKey
Real number (ℝ)
Missing 
| Distinct | 92 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 272566 |
| Missing (%) | 37.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1466432.75 |
| Minimum | 116 |
|---|---|
| Maximum | 12259753 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 116 |
|---|---|
| 5-th percentile | 121 |
| Q1 | 210 |
| median | 220 |
| Q3 | 359 |
| 95-th percentile | 9273948 |
| Maximum | 12259753 |
| Range | 12259637 |
| Interquartile range (IQR) | 149 |
Descriptive statistics
| Standard deviation | 3184973.092 |
|---|---|
| Coefficient of variation (CV) | 2.17191896 |
| Kurtosis | 1.344083486 |
| Mean | 1466432.75 |
| Median Absolute Deviation (MAD) | 83 |
| Skewness | 1.775836497 |
| Sum | 6.627425498 × 1011 |
| Variance | 1.01440536 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 359 | 59795 | 8.3% |
| 7434778 | 42882 | 5.9% |
| 210 | 39551 | 5.5% |
| 212 | 34584 | 4.8% |
| 216 | 32733 | 4.5% |
| 225 | 24245 | 3.3% |
| 353 | 23481 | 3.2% |
| 121 | 23303 | 3.2% |
| 9273948 | 22315 | 3.1% |
| 137 | 22257 | 3.1% |
| Other values (82) | 126796 | |
| (Missing) | 272566 |
| Value | Count | Frequency (%) |
| 116 | 36 | < 0.1% |
| 120 | 659 | 0.1% |
| 121 | 23303 | |
| 125 | 9 | < 0.1% |
| 126 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 12259753 | 1 | < 0.1% |
| 12203163 | 1 | < 0.1% |
| 12186859 | 12 | < 0.1% |
| 11733052 | 62 | < 0.1% |
| 11592253 | 1006 |
orderKey
Real number (ℝ)
Missing 
| Distinct | 484 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 369296 |
| Missing (%) | 51.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3512590.676 |
| Minimum | 370 |
|---|---|
| Maximum | 12263124 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 370 |
|---|---|
| 5-th percentile | 509 |
| Q1 | 798 |
| median | 1436 |
| Q3 | 7692889 |
| 95-th percentile | 11151631 |
| Maximum | 12263124 |
| Range | 12262754 |
| Interquartile range (IQR) | 7692091 |
Descriptive statistics
| Standard deviation | 4380254.677 |
|---|---|
| Coefficient of variation (CV) | 1.247015403 |
| Kurtosis | -1.502677218 |
| Mean | 3512590.676 |
| Median Absolute Deviation (MAD) | 799 |
| Skewness | 0.5410560688 |
| Sum | 1.247714359 × 1012 |
| Variance | 1.918663103 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7692889 | 32460 | 4.5% |
| 811 | 14185 | 2.0% |
| 1419 | 14086 | 1.9% |
| 1438 | 12424 | 1.7% |
| 885 | 11376 | 1.6% |
| 733 | 10382 | 1.4% |
| 7192755 | 9895 | 1.4% |
| 731 | 8981 | 1.2% |
| 509 | 8715 | 1.2% |
| 795 | 7870 | 1.1% |
| Other values (474) | 224838 | |
| (Missing) | 369296 |
| Value | Count | Frequency (%) |
| 370 | 300 | < 0.1% |
| 371 | 3664 | |
| 376 | 8 | < 0.1% |
| 381 | 5 | < 0.1% |
| 392 | 635 | 0.1% |
| Value | Count | Frequency (%) |
| 12263124 | 1 | < 0.1% |
| 12261528 | 2195 | |
| 12260364 | 11 | < 0.1% |
| 12244639 | 2 | < 0.1% |
| 12243044 | 2 | < 0.1% |
familyKey
Real number (ℝ)
Missing 
| Distinct | 4832 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 258765 |
| Missing (%) | 35.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3036480.23 |
| Minimum | 1895 |
|---|---|
| Maximum | 12262968 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1895 |
|---|---|
| 5-th percentile | 2918 |
| Q1 | 7086 |
| median | 3252093 |
| Q3 | 4834682 |
| 95-th percentile | 8052057 |
| Maximum | 12262968 |
| Range | 12261073 |
| Interquartile range (IQR) | 4827596 |
Descriptive statistics
| Standard deviation | 2821037.453 |
|---|---|
| Coefficient of variation (CV) | 0.9290485166 |
| Kurtosis | -0.5522487965 |
| Mean | 3036480.23 |
| Median Absolute Deviation (MAD) | 3242579 |
| Skewness | 0.508338039 |
| Sum | 1.414219412 × 1012 |
| Variance | 7.958252313 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3255384 | 14086 | 1.9% |
| 9496 | 13693 | 1.9% |
| 9339 | 9409 | 1.3% |
| 5888 | 7013 | 1.0% |
| 2211 | 5646 | 0.8% |
| 2986 | 5251 | 0.7% |
| 5310 | 4763 | 0.7% |
| 7923659 | 3864 | 0.5% |
| 5479 | 3840 | 0.5% |
| 5446 | 3794 | 0.5% |
| Other values (4822) | 394384 | |
| (Missing) | 258765 |
| Value | Count | Frequency (%) |
| 1895 | 12 | |
| 1897 | 3 | < 0.1% |
| 1978 | 20 | |
| 1989 | 12 | |
| 2006 | 29 |
| Value | Count | Frequency (%) |
| 12262968 | 4 | < 0.1% |
| 12247189 | 9 | < 0.1% |
| 12246268 | 3 | < 0.1% |
| 12236981 | 32 | |
| 12234980 | 3 | < 0.1% |
genusKey
Real number (ℝ)
Missing 
| Distinct | 20311 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 245070 |
| Missing (%) | 33.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4935876.249 |
| Minimum | 1000424 |
|---|---|
| Maximum | 12385426 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1000424 |
|---|---|
| 5-th percentile | 2278992 |
| Q1 | 3251308 |
| median | 4830391 |
| Q3 | 4897257 |
| 95-th percentile | 8513230 |
| Maximum | 12385426 |
| Range | 11385002 |
| Interquartile range (IQR) | 1645949 |
Descriptive statistics
| Standard deviation | 2083699.037 |
|---|---|
| Coefficient of variation (CV) | 0.4221538248 |
| Kurtosis | 0.07746136144 |
| Mean | 4935876.249 |
| Median Absolute Deviation (MAD) | 598535 |
| Skewness | 0.610001942 |
| Sum | 2.366446637 × 1012 |
| Variance | 4.341801678 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8513230 | 13693 | 1.9% |
| 4806028 | 12281 | 1.7% |
| 2481443 | 6789 | 0.9% |
| 4833150 | 3770 | 0.5% |
| 4832444 | 3029 | 0.4% |
| 2417963 | 2974 | 0.4% |
| 4848792 | 2250 | 0.3% |
| 4851051 | 2208 | 0.3% |
| 4870176 | 2080 | 0.3% |
| 2498190 | 2051 | 0.3% |
| Other values (20301) | 428313 | |
| (Missing) | 245070 |
| Value | Count | Frequency (%) |
| 1000424 | 11 | < 0.1% |
| 1003585 | 4 | < 0.1% |
| 1003655 | 29 | |
| 1003657 | 2 | < 0.1% |
| 1003659 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 12385426 | 4 | |
| 12385220 | 2 | < 0.1% |
| 12384711 | 5 | |
| 12379591 | 6 | |
| 12378210 | 2 | < 0.1% |
speciesKey
Real number (ℝ)
Missing 
| Distinct | 45066 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 450165 |
| Missing (%) | 62.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7340362.403 |
| Minimum | 1003615 |
|---|---|
| Maximum | 12353765 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.5 MiB |
Quantile statistics
| Minimum | 1003615 |
|---|---|
| 5-th percentile | 2441022 |
| Q1 | 4977965 |
| median | 8423909 |
| Q3 | 9037391.5 |
| 95-th percentile | 11127348 |
| Maximum | 12353765 |
| Range | 11350150 |
| Interquartile range (IQR) | 4059426.5 |
Descriptive statistics
| Standard deviation | 2477246.806 |
|---|---|
| Coefficient of variation (CV) | 0.3374829021 |
| Kurtosis | -0.4744667152 |
| Mean | 7340362.403 |
| Median Absolute Deviation (MAD) | 1034777 |
| Skewness | -0.5664310113 |
| Sum | 2.013777043 × 1012 |
| Variance | 6.136751738 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2481460 | 3232 | 0.4% |
| 2481469 | 1833 | 0.3% |
| 9413495 | 1648 | 0.2% |
| 8819428 | 1401 | 0.2% |
| 4941659 | 1115 | 0.2% |
| 2481465 | 1050 | 0.1% |
| 5816525 | 1044 | 0.1% |
| 5816410 | 917 | 0.1% |
| 4874907 | 816 | 0.1% |
| 12198857 | 814 | 0.1% |
| Other values (45056) | 260473 | |
| (Missing) | 450165 |
| Value | Count | Frequency (%) |
| 1003615 | 2 | |
| 1003627 | 2 | |
| 1003667 | 1 | |
| 1003733 | 1 | |
| 1003829 | 1 |
| Value | Count | Frequency (%) |
| 12353765 | 1 | < 0.1% |
| 12326275 | 2 | |
| 12279081 | 4 | |
| 12266515 | 1 | < 0.1% |
| 12266463 | 2 |
species
Text
Missing 
| Distinct | 45045 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 450165 |
| Missing (%) | 62.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 36 |
| Mean length | 19.97971153 |
| Min length | 9 |
Unique
| Unique | 16582 ? |
|---|---|
| Unique (%) | 6.0% |
Sample
| 1st row | Damaliscus lunatus |
|---|---|
| 2nd row | Acrochordiceras hyatti |
| 3rd row | Asterocyclina minima |
| 4th row | Carcharias tricuspidatus |
| 5th row | Enteletes rotundobesus |
| Value | Count | Frequency (%) |
| pterodroma | 6569 | 1.2% |
| phaeopygia | 3232 | 0.6% |
| carcharias | 2554 | 0.5% |
| hustedia | 2069 | 0.4% |
| alba | 2031 | 0.4% |
| oxyrhina | 1714 | 0.3% |
| lepidocyclina | 1710 | 0.3% |
| hyopsodus | 1699 | 0.3% |
| megalodon | 1650 | 0.3% |
| bolivina | 1496 | 0.3% |
| Other values (34798) | 523962 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 606832 | 11.1% |
| i | 523413 | 9.5% |
| s | 407795 | 7.4% |
| e | 395509 | 7.2% |
| o | 376936 | 6.9% |
| r | 355606 | 6.5% |
| n | 311565 | 5.7% |
| l | 303075 | 5.5% |
| 274343 | 5.0% | |
| t | 273072 | 5.0% |
| Other values (44) | 1653148 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4932605 | |
| Space Separator | 274343 | 5.0% |
| Uppercase Letter | 274343 | 5.0% |
| Dash Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 606832 | |
| i | 523413 | |
| s | 407795 | 8.3% |
| e | 395509 | 8.0% |
| o | 376936 | 7.6% |
| r | 355606 | 7.2% |
| n | 311565 | 6.3% |
| l | 303075 | 6.1% |
| t | 273072 | 5.5% |
| u | 258946 | 5.2% |
| Other values (16) | 1119856 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 40367 | |
| C | 33494 | |
| A | 21018 | 7.7% |
| S | 18923 | 6.9% |
| M | 18131 | 6.6% |
| T | 14851 | 5.4% |
| H | 14776 | 5.4% |
| L | 14523 | 5.3% |
| E | 13423 | 4.9% |
| B | 13044 | 4.8% |
| Other values (16) | 71793 |
Space Separator
| Value | Count | Frequency (%) |
| 274343 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5206948 | |
| Common | 274346 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 606832 | |
| i | 523413 | 10.1% |
| s | 407795 | 7.8% |
| e | 395509 | 7.6% |
| o | 376936 | 7.2% |
| r | 355606 | 6.8% |
| n | 311565 | 6.0% |
| l | 303075 | 5.8% |
| t | 273072 | 5.2% |
| u | 258946 | 5.0% |
| Other values (42) | 1394199 |
Common
| Value | Count | Frequency (%) |
| 274343 | ||
| - | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5481294 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 606832 | 11.1% |
| i | 523413 | 9.5% |
| s | 407795 | 7.4% |
| e | 395509 | 7.2% |
| o | 376936 | 6.9% |
| r | 355606 | 6.5% |
| n | 311565 | 5.7% |
| l | 303075 | 5.5% |
| 274343 | 5.0% | |
| t | 273072 | 5.0% |
| Other values (44) | 1653148 |
Missing 
| Distinct | 58335 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 171789 |
| Missing (%) | 23.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 124 |
|---|---|
| Median length | 80 |
| Mean length | 28.18863111 |
| Min length | 4 |
Unique
| Unique | 20407 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | Damaliscus lunatus (Burchell, 1823) |
|---|---|
| 2nd row | Acrochordiceras hyatti Meek, 1877 |
| 3rd row | Asterocyclina minima (Cushman, 1918) |
| 4th row | Carcharias tricuspidatus Day, 1878 |
| 5th row | Enteletes rotundobesus Cooper & Grant, 1976 |
| Value | Count | Frequency (%) |
| 80122 | 4.1% | |
| walcott | 31024 | 1.6% |
| cooper | 23991 | 1.2% |
| insecta | 16885 | 0.9% |
| 1912 | 16538 | 0.8% |
| cushman | 16371 | 0.8% |
| grant | 16172 | 0.8% |
| 1976 | 14710 | 0.7% |
| genus | 13850 | 0.7% |
| js | 13693 | 0.7% |
| Other values (46962) | 1721141 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1411778 | 9.1% | |
| a | 1238433 | 7.9% |
| e | 968772 | 6.2% |
| i | 903533 | 5.8% |
| o | 818690 | 5.3% |
| r | 805467 | 5.2% |
| s | 768080 | 4.9% |
| n | 717605 | 4.6% |
| l | 693504 | 4.5% |
| t | 605528 | 3.9% |
| Other values (99) | 6649002 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10325962 | |
| Decimal Number | 1830048 | 11.7% |
| Space Separator | 1411778 | 9.1% |
| Uppercase Letter | 1180045 | 7.6% |
| Other Punctuation | 582213 | 3.7% |
| Open Punctuation | 123706 | 0.8% |
| Close Punctuation | 123706 | 0.8% |
| Dash Punctuation | 2931 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1238433 | |
| e | 968772 | |
| i | 903533 | 8.8% |
| o | 818690 | 7.9% |
| r | 805467 | 7.8% |
| s | 768080 | 7.4% |
| n | 717605 | 6.9% |
| l | 693504 | 6.7% |
| t | 605528 | 5.9% |
| u | 452103 | 4.4% |
| Other values (47) | 2354247 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 146305 | |
| P | 101665 | 8.6% |
| S | 99149 | 8.4% |
| B | 93739 | 7.9% |
| M | 80929 | 6.9% |
| G | 77205 | 6.5% |
| L | 66062 | 5.6% |
| W | 66034 | 5.6% |
| A | 64513 | 5.5% |
| H | 57879 | 4.9% |
| Other values (22) | 326565 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 540277 | |
| 9 | 307736 | |
| 8 | 288960 | |
| 7 | 138348 | 7.6% |
| 6 | 114061 | 6.2% |
| 5 | 103286 | 5.6% |
| 2 | 99635 | 5.4% |
| 3 | 90203 | 4.9% |
| 0 | 73911 | 4.0% |
| 4 | 73631 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 461741 | |
| & | 80122 | 13.8% |
| . | 32324 | 5.6% |
| ' | 8024 | 1.4% |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1411778 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 123706 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 123706 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2931 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11506007 | |
| Common | 4074385 | 26.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1238433 | 10.8% |
| e | 968772 | 8.4% |
| i | 903533 | 7.9% |
| o | 818690 | 7.1% |
| r | 805467 | 7.0% |
| s | 768080 | 6.7% |
| n | 717605 | 6.2% |
| l | 693504 | 6.0% |
| t | 605528 | 5.3% |
| u | 452103 | 3.9% |
| Other values (79) | 3534292 |
Common
| Value | Count | Frequency (%) |
| 1411778 | ||
| 1 | 540277 | 13.3% |
| , | 461741 | 11.3% |
| 9 | 307736 | 7.6% |
| 8 | 288960 | 7.1% |
| 7 | 138348 | 3.4% |
| ( | 123706 | 3.0% |
| ) | 123706 | 3.0% |
| 6 | 114061 | 2.8% |
| 5 | 103286 | 2.5% |
| Other values (10) | 460786 | 11.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15566857 | |
| None | 13535 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1411778 | 9.1% | |
| a | 1238433 | 8.0% |
| e | 968772 | 6.2% |
| i | 903533 | 5.8% |
| o | 818690 | 5.3% |
| r | 805467 | 5.2% |
| s | 768080 | 4.9% |
| n | 717605 | 4.6% |
| l | 693504 | 4.5% |
| t | 605528 | 3.9% |
| Other values (61) | 6635467 |
None
| Value | Count | Frequency (%) |
| ü | 3598 | |
| ö | 2698 | |
| é | 2171 | |
| è | 2104 | |
| ú | 1665 | |
| ã | 293 | 2.2% |
| ž | 160 | 1.2% |
| ä | 147 | 1.1% |
| å | 122 | 0.9% |
| ë | 98 | 0.7% |
| Other values (28) | 479 | 3.5% |
Missing 
| Distinct | 97401 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 171332 |
| Missing (%) | 23.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 56 |
| Mean length | 18.07695742 |
| Min length | 5 |
Unique
| Unique | 44766 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | Damaliscus lunatus |
|---|---|
| 2nd row | Acrochordiceras hyatti |
| 3rd row | Discocyclina (Asterocyclina) sculpturata |
| 4th row | Odontaspis cuspidata |
| 5th row | Enteletes rotundobesus |
| Value | Count | Frequency (%) |
| sp | 136960 | 12.1% |
| genus | 56232 | 5.0% |
| insecta | 16851 | 1.5% |
| splendens | 12400 | 1.1% |
| marrella | 12281 | 1.1% |
| pterodroma | 7305 | 0.6% |
| var | 6498 | 0.6% |
| callophoca | 3770 | 0.3% |
| isurus | 3463 | 0.3% |
| ostracoda | 3391 | 0.3% |
| Other values (53913) | 873954 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1021294 | 10.2% |
| s | 909134 | 9.1% |
| i | 819278 | 8.2% |
| e | 762530 | 7.6% |
| o | 610330 | 6.1% |
| r | 609311 | 6.1% |
| n | 592254 | 5.9% |
| 579929 | 5.8% | |
| l | 537519 | 5.4% |
| u | 466436 | 4.7% |
| Other values (62) | 3091724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8787040 | |
| Space Separator | 579929 | 5.8% |
| Uppercase Letter | 575487 | 5.8% |
| Close Punctuation | 22326 | 0.2% |
| Open Punctuation | 22314 | 0.2% |
| Other Punctuation | 10186 | 0.1% |
| Decimal Number | 1938 | < 0.1% |
| Dash Punctuation | 518 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1021294 | |
| s | 909134 | |
| i | 819278 | |
| e | 762530 | 8.7% |
| o | 610330 | 6.9% |
| r | 609311 | 6.9% |
| n | 592254 | 6.7% |
| l | 537519 | 6.1% |
| u | 466436 | 5.3% |
| t | 465047 | 5.3% |
| Other values (16) | 1993907 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 79813 | |
| P | 69195 | |
| C | 60147 | |
| A | 39927 | 6.9% |
| M | 39806 | 6.9% |
| S | 35677 | 6.2% |
| B | 27831 | 4.8% |
| H | 26616 | 4.6% |
| T | 26590 | 4.6% |
| I | 25413 | 4.4% |
| Other values (16) | 144472 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 962 | |
| 2 | 543 | |
| 3 | 206 | 10.6% |
| 4 | 92 | 4.7% |
| 5 | 67 | 3.5% |
| 6 | 38 | 2.0% |
| 7 | 19 | 1.0% |
| 8 | 5 | 0.3% |
| 0 | 4 | 0.2% |
| 9 | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10146 | |
| ' | 21 | 0.2% |
| ? | 13 | 0.1% |
| * | 5 | < 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 579929 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22326 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22314 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 518 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9362527 | |
| Common | 637212 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1021294 | 10.9% |
| s | 909134 | 9.7% |
| i | 819278 | 8.8% |
| e | 762530 | 8.1% |
| o | 610330 | 6.5% |
| r | 609311 | 6.5% |
| n | 592254 | 6.3% |
| l | 537519 | 5.7% |
| u | 466436 | 5.0% |
| t | 465047 | 5.0% |
| Other values (42) | 2569394 |
Common
| Value | Count | Frequency (%) |
| 579929 | ||
| ) | 22326 | 3.5% |
| ( | 22314 | 3.5% |
| . | 10146 | 1.6% |
| 1 | 962 | 0.2% |
| 2 | 543 | 0.1% |
| - | 518 | 0.1% |
| 3 | 206 | < 0.1% |
| 4 | 92 | < 0.1% |
| 5 | 67 | < 0.1% |
| Other values (10) | 109 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9999739 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1021294 | 10.2% |
| s | 909134 | 9.1% |
| i | 819278 | 8.2% |
| e | 762530 | 7.6% |
| o | 610330 | 6.1% |
| r | 609311 | 6.1% |
| n | 592254 | 5.9% |
| 579929 | 5.8% | |
| l | 537519 | 5.4% |
| u | 466436 | 4.7% |
| Other values (62) | 3091724 |
typifiedName
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 724501 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Type |
|---|---|
| 2nd row | Type |
| 3rd row | Type |
| 4th row | Type |
| 5th row | Type |
| Value | Count | Frequency (%) |
| type | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 7 | |
| y | 7 | |
| p | 7 | |
| e | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21 | |
| Uppercase Letter | 7 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 7 | |
| p | 7 | |
| e | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 7 | |
| y | 7 | |
| p | 7 | |
| e | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 7 | |
| y | 7 | |
| p | 7 | |
| e | 7 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 724508 | |
| M | 724508 | |
| L | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2173524 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 724508 | |
| M | 724508 | |
| L | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2173524 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 724508 | |
| M | 724508 | |
| L | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2173524 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 724508 | |
| M | 724508 | |
| L | 724508 |
lastParsed
Text
| Distinct | 37858 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99520778 |
| Min length | 20 |
Unique
| Unique | 984 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2024-12-02T10:16:26.190Z |
|---|---|
| 2nd row | 2024-12-02T10:16:26.321Z |
| 3rd row | 2024-12-02T10:16:26.322Z |
| 4th row | 2024-12-02T10:16:26.322Z |
| 5th row | 2024-12-02T10:16:26.323Z |
| Value | Count | Frequency (%) |
| 2024-12-02t10:17:03.880z | 100 | < 0.1% |
| 2024-12-02t10:17:08.512z | 92 | < 0.1% |
| 2024-12-02t10:17:04.870z | 87 | < 0.1% |
| 2024-12-02t10:17:05.654z | 87 | < 0.1% |
| 2024-12-02t10:16:52.136z | 85 | < 0.1% |
| 2024-12-02t10:16:59.768z | 85 | < 0.1% |
| 2024-12-02t10:17:07.114z | 85 | < 0.1% |
| 2024-12-02t10:16:58.778z | 84 | < 0.1% |
| 2024-12-02t10:17:03.172z | 84 | < 0.1% |
| 2024-12-02t10:17:08.976z | 83 | < 0.1% |
| Other values (37848) | 723636 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 1185381 | 6.8% |
| 6 | 810755 | 4.7% |
| T | 724508 | 4.2% |
| Z | 724508 | 4.2% |
| . | 723640 | 4.2% |
| Other values (5) | 2004001 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12314032 | |
| Other Punctuation | 2172656 | 12.5% |
| Dash Punctuation | 1449016 | 8.3% |
| Uppercase Letter | 1449016 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| 4 | 1185381 | 9.6% |
| 6 | 810755 | 6.6% |
| 7 | 497480 | 4.0% |
| 5 | 488259 | 4.0% |
| 3 | 420464 | 3.4% |
| 9 | 301334 | 2.4% |
| 8 | 296464 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1449016 | |
| . | 723640 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1449016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15935704 | |
| Latin | 1449016 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 1185381 | 7.4% |
| 6 | 810755 | 5.1% |
| . | 723640 | 4.5% |
| 7 | 497480 | 3.1% |
| 5 | 488259 | 3.1% |
| Other values (3) | 1018262 | 6.4% |
Latin
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17384720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3187397 | |
| 0 | 2663851 | |
| 1 | 2462647 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 1185381 | 6.8% |
| 6 | 810755 | 4.7% |
| T | 724508 | 4.2% |
| Z | 724508 | 4.2% |
| . | 723640 | 4.2% |
| Other values (5) | 2004001 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024-12-02T10:02:33.848Z |
|---|---|
| 2nd row | 2024-12-02T10:02:33.848Z |
| 3rd row | 2024-12-02T10:02:33.848Z |
| 4th row | 2024-12-02T10:02:33.848Z |
| 5th row | 2024-12-02T10:02:33.848Z |
| Value | Count | Frequency (%) |
| 2024-12-02t10:02:33.848z | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3622540 | |
| 0 | 2898032 | |
| 4 | 1449016 | 8.3% |
| - | 1449016 | 8.3% |
| 1 | 1449016 | 8.3% |
| : | 1449016 | 8.3% |
| 3 | 1449016 | 8.3% |
| 8 | 1449016 | 8.3% |
| T | 724508 | 4.2% |
| . | 724508 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12316636 | |
| Other Punctuation | 2173524 | 12.5% |
| Dash Punctuation | 1449016 | 8.3% |
| Uppercase Letter | 1449016 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3622540 | |
| 0 | 2898032 | |
| 4 | 1449016 | 11.8% |
| 1 | 1449016 | 11.8% |
| 3 | 1449016 | 11.8% |
| 8 | 1449016 | 11.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1449016 | |
| . | 724508 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1449016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15939176 | |
| Latin | 1449016 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3622540 | |
| 0 | 2898032 | |
| 4 | 1449016 | 9.1% |
| - | 1449016 | 9.1% |
| 1 | 1449016 | 9.1% |
| : | 1449016 | 9.1% |
| 3 | 1449016 | 9.1% |
| 8 | 1449016 | 9.1% |
| . | 724508 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 724508 | |
| Z | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17388192 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3622540 | |
| 0 | 2898032 | |
| 4 | 1449016 | 8.3% |
| - | 1449016 | 8.3% |
| 1 | 1449016 | 8.3% |
| : | 1449016 | 8.3% |
| 3 | 1449016 | 8.3% |
| 8 | 1449016 | 8.3% |
| T | 724508 | 4.2% |
| . | 724508 | 4.2% |
repatriated
Boolean
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 158317 |
| Missing (%) | 21.9% |
| Memory size | 5.5 MiB |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 428942 | |
| True | 137249 | 18.9% |
| (Missing) | 158317 | 21.9% |
isSequenced
Boolean
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 707.7 KiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 724508 |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 160612 |
| Missing (%) | 22.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.4128545 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | AFRICA |
| 3rd row | NORTH_AMERICA |
| 4th row | LATIN_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 468544 | |
| latin_america | 47663 | 8.5% |
| europe | 16154 | 2.9% |
| asia | 10382 | 1.8% |
| oceania | 9334 | 1.7% |
| africa | 8278 | 1.5% |
| antarctica | 3541 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1146688 | |
| R | 1012724 | |
| I | 595405 | |
| E | 557849 | |
| C | 540901 | |
| N | 529082 | |
| T | 523289 | |
| _ | 516207 | |
| M | 516207 | |
| O | 494032 | |
| Other values (6) | 567175 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6483352 | |
| Connector Punctuation | 516207 | 7.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1146688 | |
| R | 1012724 | |
| I | 595405 | |
| E | 557849 | |
| C | 540901 | |
| N | 529082 | |
| T | 523289 | |
| M | 516207 | |
| O | 494032 | |
| H | 468544 | |
| Other values (5) | 98631 | 1.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 516207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6483352 | |
| Common | 516207 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1146688 | |
| R | 1012724 | |
| I | 595405 | |
| E | 557849 | |
| C | 540901 | |
| N | 529082 | |
| T | 523289 | |
| M | 516207 | |
| O | 494032 | |
| H | 468544 | |
| Other values (5) | 98631 | 1.5% |
Common
| Value | Count | Frequency (%) |
| _ | 516207 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6999559 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1146688 | |
| R | 1012724 | |
| I | 595405 | |
| E | 557849 | |
| C | 540901 | |
| N | 529082 | |
| T | 523289 | |
| _ | 516207 | |
| M | 516207 | |
| O | 494032 | |
| Other values (6) | 567175 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1449016 | |
| A | 1449016 | |
| N | 724508 | |
| O | 724508 | |
| T | 724508 | |
| H | 724508 | |
| _ | 724508 | |
| M | 724508 | |
| E | 724508 | |
| I | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8694096 | |
| Connector Punctuation | 724508 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1449016 | |
| A | 1449016 | |
| N | 724508 | |
| O | 724508 | |
| T | 724508 | |
| H | 724508 | |
| M | 724508 | |
| E | 724508 | |
| I | 724508 | |
| C | 724508 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8694096 | |
| Common | 724508 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1449016 | |
| A | 1449016 | |
| N | 724508 | |
| O | 724508 | |
| T | 724508 | |
| H | 724508 | |
| M | 724508 | |
| E | 724508 | |
| I | 724508 | |
| C | 724508 |
Common
| Value | Count | Frequency (%) |
| _ | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9418604 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1449016 | |
| A | 1449016 | |
| N | 724508 | |
| O | 724508 | |
| T | 724508 | |
| H | 724508 | |
| _ | 724508 | |
| M | 724508 | |
| E | 724508 | |
| I | 724508 |
level0Gid
Text
Missing 
| Distinct | 88 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 686240 |
| Missing (%) | 94.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | USA |
| 3rd row | USA |
| 4th row | USA |
| 5th row | USA |
| Value | Count | Frequency (%) |
| usa | 33578 | |
| mex | 743 | 1.9% |
| can | 398 | 1.0% |
| gum | 255 | 0.7% |
| mnp | 228 | 0.6% |
| pan | 217 | 0.6% |
| idn | 210 | 0.5% |
| umi | 206 | 0.5% |
| fra | 198 | 0.5% |
| pak | 155 | 0.4% |
| Other values (78) | 2080 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 35038 | |
| U | 34448 | |
| S | 33870 | |
| M | 1679 | 1.5% |
| N | 1312 | 1.1% |
| E | 1296 | 1.1% |
| P | 945 | 0.8% |
| I | 802 | 0.7% |
| X | 743 | 0.6% |
| R | 715 | 0.6% |
| Other values (15) | 3956 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 114804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 35038 | |
| U | 34448 | |
| S | 33870 | |
| M | 1679 | 1.5% |
| N | 1312 | 1.1% |
| E | 1296 | 1.1% |
| P | 945 | 0.8% |
| I | 802 | 0.7% |
| X | 743 | 0.6% |
| R | 715 | 0.6% |
| Other values (15) | 3956 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 114804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 35038 | |
| U | 34448 | |
| S | 33870 | |
| M | 1679 | 1.5% |
| N | 1312 | 1.1% |
| E | 1296 | 1.1% |
| P | 945 | 0.8% |
| I | 802 | 0.7% |
| X | 743 | 0.6% |
| R | 715 | 0.6% |
| Other values (15) | 3956 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 35038 | |
| U | 34448 | |
| S | 33870 | |
| M | 1679 | 1.5% |
| N | 1312 | 1.1% |
| E | 1296 | 1.1% |
| P | 945 | 0.8% |
| I | 802 | 0.7% |
| X | 743 | 0.6% |
| R | 715 | 0.6% |
| Other values (15) | 3956 | 3.4% |
level0Name
Text
Missing 
| Distinct | 88 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 686240 |
| Missing (%) | 94.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 13 |
| Mean length | 12.50533082 |
| Min length | 4 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 33879 | |
| states | 33784 | |
| méxico | 743 | 1.0% |
| canada | 398 | 0.5% |
| islands | 291 | 0.4% |
| guam | 255 | 0.3% |
| northern | 235 | 0.3% |
| mariana | 228 | 0.3% |
| panama | 217 | 0.3% |
| indonesia | 210 | 0.3% |
| Other values (93) | 3333 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 102628 | |
| e | 69101 | |
| a | 39466 | 8.2% |
| n | 37290 | 7.8% |
| i | 37192 | 7.8% |
| 35305 | 7.4% | |
| s | 35286 | 7.4% |
| d | 35157 | 7.3% |
| S | 34062 | 7.1% |
| U | 33936 | 7.1% |
| Other values (43) | 19131 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 369454 | |
| Uppercase Letter | 73623 | 15.4% |
| Space Separator | 35305 | 7.4% |
| Other Punctuation | 172 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 102628 | |
| e | 69101 | |
| a | 39466 | 10.7% |
| n | 37290 | 10.1% |
| i | 37192 | 10.1% |
| s | 35286 | 9.6% |
| d | 35157 | 9.5% |
| o | 2317 | 0.6% |
| r | 1965 | 0.5% |
| c | 1490 | 0.4% |
| Other values (16) | 7562 | 2.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 34062 | |
| U | 33936 | |
| M | 1323 | 1.8% |
| I | 913 | 1.2% |
| P | 587 | 0.8% |
| C | 586 | 0.8% |
| G | 328 | 0.4% |
| N | 301 | 0.4% |
| E | 210 | 0.3% |
| O | 206 | 0.3% |
| Other values (13) | 1171 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 114 | |
| , | 57 | |
| ' | 1 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 35305 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 443077 | |
| Common | 35477 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 102628 | |
| e | 69101 | |
| a | 39466 | 8.9% |
| n | 37290 | 8.4% |
| i | 37192 | 8.4% |
| s | 35286 | 8.0% |
| d | 35157 | 7.9% |
| S | 34062 | 7.7% |
| U | 33936 | 7.7% |
| o | 2317 | 0.5% |
| Other values (39) | 16642 | 3.8% |
Common
| Value | Count | Frequency (%) |
| 35305 | ||
| . | 114 | 0.3% |
| , | 57 | 0.2% |
| ' | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 477810 | |
| None | 744 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 102628 | |
| e | 69101 | |
| a | 39466 | 8.3% |
| n | 37290 | 7.8% |
| i | 37192 | 7.8% |
| 35305 | 7.4% | |
| s | 35286 | 7.4% |
| d | 35157 | 7.4% |
| S | 34062 | 7.1% |
| U | 33936 | 7.1% |
| Other values (41) | 18387 | 3.8% |
None
| Value | Count | Frequency (%) |
| é | 743 | |
| ô | 1 | 0.1% |
level1Gid
Text
Missing 
| Distinct | 353 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 686243 |
| Missing (%) | 94.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.803449628 |
| Min length | 7 |
Unique
| Unique | 98 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | USA.10_1 |
|---|---|
| 2nd row | USA.29_1 |
| 3rd row | USA.2_1 |
| 4th row | USA.44_1 |
| 5th row | USA.38_1 |
| Value | Count | Frequency (%) |
| usa.44_1 | 3802 | 9.9% |
| usa.38_1 | 2959 | 7.7% |
| usa.23_1 | 2129 | 5.6% |
| usa.34_1 | 2095 | 5.5% |
| usa.10_1 | 1141 | 3.0% |
| usa.17_1 | 1123 | 2.9% |
| usa.32_1 | 1117 | 2.9% |
| usa.18_1 | 1042 | 2.7% |
| usa.1_1 | 1017 | 2.7% |
| usa.2_1 | 983 | 2.6% |
| Other values (343) | 20857 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 48709 | |
| . | 38265 | |
| _ | 38265 | |
| A | 35032 | |
| U | 34448 | |
| S | 33870 | |
| 4 | 15952 | 5.3% |
| 3 | 14650 | 4.9% |
| 2 | 8837 | 3.0% |
| 8 | 5433 | 1.8% |
| Other values (27) | 25138 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 114795 | |
| Decimal Number | 107274 | |
| Other Punctuation | 38265 | 12.8% |
| Connector Punctuation | 38265 | 12.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 35032 | |
| U | 34448 | |
| S | 33870 | |
| M | 1679 | 1.5% |
| N | 1312 | 1.1% |
| E | 1296 | 1.1% |
| P | 945 | 0.8% |
| I | 802 | 0.7% |
| X | 743 | 0.6% |
| R | 715 | 0.6% |
| Other values (15) | 3953 | 3.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 48709 | |
| 4 | 15952 | 14.9% |
| 3 | 14650 | 13.7% |
| 2 | 8837 | 8.2% |
| 8 | 5433 | 5.1% |
| 5 | 3413 | 3.2% |
| 7 | 3029 | 2.8% |
| 6 | 2776 | 2.6% |
| 0 | 2416 | 2.3% |
| 9 | 2059 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 38265 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 38265 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 183804 | |
| Latin | 114795 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 35032 | |
| U | 34448 | |
| S | 33870 | |
| M | 1679 | 1.5% |
| N | 1312 | 1.1% |
| E | 1296 | 1.1% |
| P | 945 | 0.8% |
| I | 802 | 0.7% |
| X | 743 | 0.6% |
| R | 715 | 0.6% |
| Other values (15) | 3953 | 3.4% |
Common
| Value | Count | Frequency (%) |
| 1 | 48709 | |
| . | 38265 | |
| _ | 38265 | |
| 4 | 15952 | 8.7% |
| 3 | 14650 | 8.0% |
| 2 | 8837 | 4.8% |
| 8 | 5433 | 3.0% |
| 5 | 3413 | 1.9% |
| 7 | 3029 | 1.6% |
| 6 | 2776 | 1.5% |
| Other values (2) | 4475 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 298599 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 48709 | |
| . | 38265 | |
| _ | 38265 | |
| A | 35032 | |
| U | 34448 | |
| S | 33870 | |
| 4 | 15952 | 5.3% |
| 3 | 14650 | 4.9% |
| 2 | 8837 | 3.0% |
| 8 | 5433 | 1.8% |
| Other values (27) | 25138 |
level1Name
Text
Missing 
| Distinct | 353 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 686243 |
| Missing (%) | 94.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 8.062981837 |
| Min length | 3 |
Unique
| Unique | 98 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Nevada |
| 3rd row | Alaska |
| 4th row | Texas |
| 5th row | Oregon |
| Value | Count | Frequency (%) |
| texas | 3802 | 8.4% |
| oregon | 2959 | 6.5% |
| carolina | 2734 | 6.0% |
| new | 2376 | 5.2% |
| michigan | 2129 | 4.7% |
| north | 2102 | 4.6% |
| florida | 1141 | 2.5% |
| kansas | 1123 | 2.5% |
| mexico | 1117 | 2.5% |
| kentucky | 1042 | 2.3% |
| Other values (409) | 24889 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 41520 | 13.5% |
| i | 26049 | 8.4% |
| o | 23918 | 7.8% |
| n | 23868 | 7.7% |
| e | 19743 | 6.4% |
| r | 18679 | 6.1% |
| s | 17076 | 5.5% |
| l | 13026 | 4.2% |
| h | 9794 | 3.2% |
| t | 9186 | 3.0% |
| Other values (71) | 105671 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 254467 | |
| Uppercase Letter | 45849 | 14.9% |
| Space Separator | 7149 | 2.3% |
| Dash Punctuation | 932 | 0.3% |
| Other Punctuation | 132 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 41520 | |
| i | 26049 | |
| o | 23918 | |
| n | 23868 | |
| e | 19743 | 7.8% |
| r | 18679 | 7.3% |
| s | 17076 | 6.7% |
| l | 13026 | 5.1% |
| h | 9794 | 3.8% |
| t | 9186 | 3.6% |
| Other values (37) | 51608 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 6252 | |
| N | 5334 | |
| C | 5061 | |
| O | 4909 | |
| T | 4580 | |
| A | 3206 | 7.0% |
| K | 2441 | 5.3% |
| W | 2083 | 4.5% |
| V | 1707 | 3.7% |
| S | 1669 | 3.6% |
| Other values (19) | 8607 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 130 | |
| / | 2 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 7149 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 932 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 300316 | |
| Common | 8214 | 2.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 41520 | |
| i | 26049 | 8.7% |
| o | 23918 | 8.0% |
| n | 23868 | 7.9% |
| e | 19743 | 6.6% |
| r | 18679 | 6.2% |
| s | 17076 | 5.7% |
| l | 13026 | 4.3% |
| h | 9794 | 3.3% |
| t | 9186 | 3.1% |
| Other values (66) | 97457 |
Common
| Value | Count | Frequency (%) |
| 7149 | ||
| - | 932 | 11.3% |
| ' | 130 | 1.6% |
| / | 2 | < 0.1% |
| ` | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 307555 | |
| None | 975 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 41520 | 13.5% |
| i | 26049 | 8.5% |
| o | 23918 | 7.8% |
| n | 23868 | 7.8% |
| e | 19743 | 6.4% |
| r | 18679 | 6.1% |
| s | 17076 | 5.6% |
| l | 13026 | 4.2% |
| h | 9794 | 3.2% |
| t | 9186 | 3.0% |
| Other values (46) | 104696 |
None
| Value | Count | Frequency (%) |
| ó | 226 | |
| á | 167 | |
| é | 142 | |
| ý | 124 | |
| í | 96 | |
| ñ | 52 | 5.3% |
| Î | 36 | 3.7% |
| š | 25 | 2.6% |
| ô | 21 | 2.2% |
| ę | 15 | 1.5% |
| Other values (15) | 71 | 7.3% |
level2Gid
Text
Missing 
| Distinct | 1562 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 687320 |
| Missing (%) | 94.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.68914704 |
| Min length | 9 |
Unique
| Unique | 384 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | USA.10.3_1 |
|---|---|
| 2nd row | USA.29.10_1 |
| 3rd row | USA.2.17_1 |
| 4th row | USA.44.57_1 |
| 5th row | USA.38.21_1 |
| Value | Count | Frequency (%) |
| usa.23.44_1 | 1758 | 4.7% |
| usa.38.21_1 | 1751 | 4.7% |
| mex.30.91_2 | 673 | 1.8% |
| usa.36.44_1 | 428 | 1.2% |
| usa.8.2_1 | 412 | 1.1% |
| usa.41.8_1 | 377 | 1.0% |
| usa.2.17_1 | 366 | 1.0% |
| usa.44.22_1 | 329 | 0.9% |
| usa.44.252_1 | 321 | 0.9% |
| usa.32.31_1 | 307 | 0.8% |
| Other values (1552) | 30466 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 74376 | |
| 1 | 61042 | |
| _ | 37188 | |
| A | 34984 | |
| U | 33973 | |
| S | 33842 | |
| 4 | 26872 | 6.8% |
| 2 | 21885 | 5.5% |
| 3 | 20338 | 5.1% |
| 8 | 8756 | 2.2% |
| Other values (27) | 44252 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 174380 | |
| Uppercase Letter | 111564 | |
| Other Punctuation | 74376 | |
| Connector Punctuation | 37188 | 9.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 34984 | |
| U | 33973 | |
| S | 33842 | |
| E | 1269 | 1.1% |
| N | 1077 | 1.0% |
| M | 923 | 0.8% |
| X | 743 | 0.7% |
| C | 674 | 0.6% |
| P | 593 | 0.5% |
| R | 571 | 0.5% |
| Other values (15) | 2915 | 2.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 61042 | |
| 4 | 26872 | |
| 2 | 21885 | 12.6% |
| 3 | 20338 | 11.7% |
| 8 | 8756 | 5.0% |
| 5 | 8670 | 5.0% |
| 7 | 7987 | 4.6% |
| 6 | 7073 | 4.1% |
| 9 | 6011 | 3.4% |
| 0 | 5746 | 3.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 74376 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 37188 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 285944 | |
| Latin | 111564 | 28.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 34984 | |
| U | 33973 | |
| S | 33842 | |
| E | 1269 | 1.1% |
| N | 1077 | 1.0% |
| M | 923 | 0.8% |
| X | 743 | 0.7% |
| C | 674 | 0.6% |
| P | 593 | 0.5% |
| R | 571 | 0.5% |
| Other values (15) | 2915 | 2.6% |
Common
| Value | Count | Frequency (%) |
| . | 74376 | |
| 1 | 61042 | |
| _ | 37188 | |
| 4 | 26872 | 9.4% |
| 2 | 21885 | 7.7% |
| 3 | 20338 | 7.1% |
| 8 | 8756 | 3.1% |
| 5 | 8670 | 3.0% |
| 7 | 7987 | 2.8% |
| 6 | 7073 | 2.5% |
| Other values (2) | 11757 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 397508 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 74376 | |
| 1 | 61042 | |
| _ | 37188 | |
| A | 34984 | |
| U | 33973 | |
| S | 33842 | |
| 4 | 26872 | 6.8% |
| 2 | 21885 | 5.5% |
| 3 | 20338 | 5.1% |
| 8 | 8756 | 2.2% |
| Other values (27) | 44252 |
level2Name
Text
Missing 
| Distinct | 1254 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 687320 |
| Missing (%) | 94.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 25 |
| Mean length | 7.870119393 |
| Min length | 3 |
Unique
| Unique | 303 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Bay |
|---|---|
| 2nd row | Lincoln |
| 3rd row | North Slope |
| 4th row | Dallas |
| 5th row | Lincoln |
| Value | Count | Frequency (%) |
| lake | 3351 | 7.3% |
| hurron | 1795 | 3.9% |
| lincoln | 1776 | 3.9% |
| superior | 694 | 1.5% |
| jesús | 673 | 1.5% |
| carranza | 673 | 1.5% |
| washington | 612 | 1.3% |
| new | 537 | 1.2% |
| san | 534 | 1.2% |
| erie | 465 | 1.0% |
| Other values (1364) | 34807 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 29477 | 10.1% |
| e | 27848 | 9.5% |
| r | 23544 | 8.0% |
| n | 23216 | 7.9% |
| o | 21878 | 7.5% |
| l | 16497 | 5.6% |
| i | 15387 | 5.3% |
| t | 11258 | 3.8% |
| s | 11244 | 3.8% |
| u | 8940 | 3.1% |
| Other values (75) | 103385 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 236559 | |
| Uppercase Letter | 46425 | 15.9% |
| Space Separator | 8729 | 3.0% |
| Dash Punctuation | 537 | 0.2% |
| Other Punctuation | 402 | 0.1% |
| Open Punctuation | 10 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 29477 | |
| e | 27848 | |
| r | 23544 | |
| n | 23216 | |
| o | 21878 | |
| l | 16497 | 7.0% |
| i | 15387 | 6.5% |
| t | 11258 | 4.8% |
| s | 11244 | 4.8% |
| u | 8940 | 3.8% |
| Other values (38) | 47270 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 7061 | |
| C | 6706 | |
| S | 4155 | 8.9% |
| B | 3487 | 7.5% |
| H | 3264 | 7.0% |
| M | 2112 | 4.5% |
| P | 2058 | 4.4% |
| W | 2005 | 4.3% |
| T | 1666 | 3.6% |
| D | 1665 | 3.6% |
| Other values (17) | 12246 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 333 | |
| / | 47 | 11.7% |
| . | 21 | 5.2% |
| , | 1 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 8729 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 537 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 282984 | |
| Common | 9690 | 3.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 29477 | 10.4% |
| e | 27848 | 9.8% |
| r | 23544 | 8.3% |
| n | 23216 | 8.2% |
| o | 21878 | 7.7% |
| l | 16497 | 5.8% |
| i | 15387 | 5.4% |
| t | 11258 | 4.0% |
| s | 11244 | 4.0% |
| u | 8940 | 3.2% |
| Other values (65) | 93695 |
Common
| Value | Count | Frequency (%) |
| 8729 | ||
| - | 537 | 5.5% |
| ' | 333 | 3.4% |
| / | 47 | 0.5% |
| . | 21 | 0.2% |
| ( | 10 | 0.1% |
| ) | 10 | 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 291303 | |
| None | 1371 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 29477 | 10.1% |
| e | 27848 | 9.6% |
| r | 23544 | 8.1% |
| n | 23216 | 8.0% |
| o | 21878 | 7.5% |
| l | 16497 | 5.7% |
| i | 15387 | 5.3% |
| t | 11258 | 3.9% |
| s | 11244 | 3.9% |
| u | 8940 | 3.1% |
| Other values (51) | 102014 |
None
| Value | Count | Frequency (%) |
| ú | 673 | |
| ó | 328 | |
| é | 112 | 8.2% |
| í | 101 | 7.4% |
| š | 26 | 1.9% |
| á | 25 | 1.8% |
| è | 22 | 1.6% |
| ř | 20 | 1.5% |
| ü | 14 | 1.0% |
| ô | 9 | 0.7% |
| Other values (14) | 41 | 3.0% |
level3Gid
Text
Missing 
| Distinct | 340 |
|---|---|
| Distinct (%) | 17.0% |
| Missing | 722506 |
| Missing (%) | 99.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 11.82917083 |
| Min length | 11 |
Unique
| Unique | 159 ? |
|---|---|
| Unique (%) | 7.9% |
Sample
| 1st row | IDN.34.7.16_1 |
|---|---|
| 2nd row | MMR.7.4.6_1 |
| 3rd row | POL.15.20.6_1 |
| 4th row | PAK.7.8.3_1 |
| 5th row | ESP.17.1.4_1 |
| Value | Count | Frequency (%) |
| pan.4.2.2_1 | 216 | 10.8% |
| idn.34.7.16_1 | 162 | 8.1% |
| ecu.9.2.2_1 | 82 | 4.1% |
| can.9.24.1_1 | 79 | 3.9% |
| pak.7.8.3_1 | 59 | 2.9% |
| can.8.1.2_1 | 56 | 2.8% |
| mar.4.2.10_1 | 41 | 2.0% |
| can.9.22.1_1 | 37 | 1.8% |
| can.9.32.5_1 | 30 | 1.5% |
| can.9.23.1_1 | 30 | 1.5% |
| Other values (330) | 1210 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6006 | |
| 1 | 3975 | |
| _ | 2002 | 8.5% |
| 2 | 1598 | 6.7% |
| A | 1231 | 5.2% |
| 4 | 921 | 3.9% |
| N | 838 | 3.5% |
| 3 | 797 | 3.4% |
| P | 565 | 2.4% |
| C | 518 | 2.2% |
| Other values (23) | 5231 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9668 | |
| Other Punctuation | 6006 | |
| Uppercase Letter | 6006 | |
| Connector Punctuation | 2002 | 8.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1231 | |
| N | 838 | |
| P | 565 | |
| C | 518 | |
| R | 411 | 6.8% |
| I | 371 | 6.2% |
| E | 307 | 5.1% |
| D | 302 | 5.0% |
| F | 198 | 3.3% |
| T | 165 | 2.7% |
| Other values (11) | 1100 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3975 | |
| 2 | 1598 | |
| 4 | 921 | 9.5% |
| 3 | 797 | 8.2% |
| 9 | 485 | 5.0% |
| 7 | 479 | 5.0% |
| 5 | 449 | 4.6% |
| 6 | 405 | 4.2% |
| 8 | 337 | 3.5% |
| 0 | 222 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6006 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2002 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17676 | |
| Latin | 6006 | 25.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1231 | |
| N | 838 | |
| P | 565 | |
| C | 518 | |
| R | 411 | 6.8% |
| I | 371 | 6.2% |
| E | 307 | 5.1% |
| D | 302 | 5.0% |
| F | 198 | 3.3% |
| T | 165 | 2.7% |
| Other values (11) | 1100 |
Common
| Value | Count | Frequency (%) |
| . | 6006 | |
| 1 | 3975 | |
| _ | 2002 | 11.3% |
| 2 | 1598 | 9.0% |
| 4 | 921 | 5.2% |
| 3 | 797 | 4.5% |
| 9 | 485 | 2.7% |
| 7 | 479 | 2.7% |
| 5 | 449 | 2.5% |
| 6 | 405 | 2.3% |
| Other values (2) | 559 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23682 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 6006 | |
| 1 | 3975 | |
| _ | 2002 | 8.5% |
| 2 | 1598 | 6.7% |
| A | 1231 | 5.2% |
| 4 | 921 | 3.9% |
| N | 838 | 3.5% |
| 3 | 797 | 3.4% |
| P | 565 | 2.4% |
| C | 518 | 2.2% |
| Other values (23) | 5231 |
level3Name
Text
Missing 
| Distinct | 340 |
|---|---|
| Distinct (%) | 17.0% |
| Missing | 722506 |
| Missing (%) | 99.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 11.58741259 |
| Min length | 3 |
Unique
| Unique | 159 ? |
|---|---|
| Unique (%) | 7.9% |
Sample
| 1st row | Sangkulirang |
|---|---|
| 2nd row | Thayet |
| 3rd row | Raszków |
| 4th row | Mianwali |
| 5th row | n.a. (108) |
| Value | Count | Frequency (%) |
| barrio | 216 | 6.2% |
| sur | 216 | 6.2% |
| lake | 172 | 5.0% |
| sangkulirang | 162 | 4.7% |
| santa | 84 | 2.4% |
| cab | 82 | 2.4% |
| n.a | 82 | 2.4% |
| floreana | 82 | 2.4% |
| isla | 82 | 2.4% |
| mara | 82 | 2.4% |
| Other values (426) | 2205 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3242 | 14.0% |
| r | 2007 | 8.7% |
| n | 1526 | 6.6% |
| i | 1489 | 6.4% |
| 1463 | 6.3% | |
| e | 1431 | 6.2% |
| o | 1107 | 4.8% |
| u | 917 | 4.0% |
| l | 907 | 3.9% |
| S | 747 | 3.2% |
| Other values (69) | 8362 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17193 | |
| Uppercase Letter | 3353 | 14.5% |
| Space Separator | 1463 | 6.3% |
| Other Punctuation | 378 | 1.6% |
| Open Punctuation | 286 | 1.2% |
| Decimal Number | 254 | 1.1% |
| Close Punctuation | 204 | 0.9% |
| Dash Punctuation | 67 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3242 | |
| r | 2007 | |
| n | 1526 | |
| i | 1489 | |
| e | 1431 | |
| o | 1107 | 6.4% |
| u | 917 | 5.3% |
| l | 907 | 5.3% |
| t | 616 | 3.6% |
| g | 594 | 3.5% |
| Other values (26) | 3357 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 747 | |
| B | 433 | |
| L | 308 | 9.2% |
| M | 229 | 6.8% |
| C | 190 | 5.7% |
| F | 169 | 5.0% |
| I | 144 | 4.3% |
| K | 137 | 4.1% |
| A | 111 | 3.3% |
| D | 105 | 3.1% |
| Other values (15) | 780 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 65 | |
| 1 | 52 | |
| 6 | 31 | |
| 8 | 25 | 9.8% |
| 0 | 19 | 7.5% |
| 3 | 19 | 7.5% |
| 7 | 17 | 6.7% |
| 5 | 12 | 4.7% |
| 4 | 8 | 3.1% |
| 9 | 6 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 277 | |
| , | 73 | 19.3% |
| ' | 24 | 6.3% |
| / | 4 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1463 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 286 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 204 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 67 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20546 | |
| Common | 2652 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3242 | |
| r | 2007 | 9.8% |
| n | 1526 | 7.4% |
| i | 1489 | 7.2% |
| e | 1431 | 7.0% |
| o | 1107 | 5.4% |
| u | 917 | 4.5% |
| l | 907 | 4.4% |
| S | 747 | 3.6% |
| t | 616 | 3.0% |
| Other values (51) | 6557 |
Common
| Value | Count | Frequency (%) |
| 1463 | ||
| ( | 286 | 10.8% |
| . | 277 | 10.4% |
| ) | 204 | 7.7% |
| , | 73 | 2.8% |
| - | 67 | 2.5% |
| 2 | 65 | 2.5% |
| 1 | 52 | 2.0% |
| 6 | 31 | 1.2% |
| 8 | 25 | 0.9% |
| Other values (8) | 109 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23097 | |
| None | 101 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3242 | 14.0% |
| r | 2007 | 8.7% |
| n | 1526 | 6.6% |
| i | 1489 | 6.4% |
| 1463 | 6.3% | |
| e | 1431 | 6.2% |
| o | 1107 | 4.8% |
| u | 917 | 4.0% |
| l | 907 | 3.9% |
| S | 747 | 3.2% |
| Other values (58) | 8261 |
None
| Value | Count | Frequency (%) |
| é | 28 | |
| è | 21 | |
| É | 19 | |
| ü | 9 | 8.9% |
| ó | 8 | 7.9% |
| í | 7 | 6.9% |
| á | 3 | 3.0% |
| ę | 2 | 2.0% |
| ö | 2 | 2.0% |
| ê | 1 | 1.0% |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 365809 |
| Missing (%) | 50.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LC |
|---|---|
| 2nd row | NE |
| 3rd row | NE |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 340013 | |
| lc | 7458 | 2.1% |
| cr | 3457 | 1.0% |
| vu | 3162 | 0.9% |
| en | 2012 | 0.6% |
| ex | 1761 | 0.5% |
| nt | 761 | 0.2% |
| dd | 73 | < 0.1% |
| ew | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 343788 | |
| N | 342786 | |
| C | 10915 | 1.5% |
| L | 7458 | 1.0% |
| R | 3457 | 0.5% |
| V | 3162 | 0.4% |
| U | 3162 | 0.4% |
| X | 1761 | 0.2% |
| T | 761 | 0.1% |
| D | 146 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 717398 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 343788 | |
| N | 342786 | |
| C | 10915 | 1.5% |
| L | 7458 | 1.0% |
| R | 3457 | 0.5% |
| V | 3162 | 0.4% |
| U | 3162 | 0.4% |
| X | 1761 | 0.2% |
| T | 761 | 0.1% |
| D | 146 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 717398 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 343788 | |
| N | 342786 | |
| C | 10915 | 1.5% |
| L | 7458 | 1.0% |
| R | 3457 | 0.5% |
| V | 3162 | 0.4% |
| U | 3162 | 0.4% |
| X | 1761 | 0.2% |
| T | 761 | 0.1% |
| D | 146 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 717398 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 343788 | |
| N | 342786 | |
| C | 10915 | 1.5% |
| L | 7458 | 1.0% |
| R | 3457 | 0.5% |
| V | 3162 | 0.4% |
| U | 3162 | 0.4% |
| X | 1761 | 0.2% |
| T | 761 | 0.1% |
| D | 146 | < 0.1% |